Gene Caul_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2198 
Symbol 
ID5899653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2390305 
End bp2391828 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content66% 
IMG OID641562690 
Producthypothetical protein 
Protein accessionYP_001683824 
Protein GI167646161 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.738171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.508584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACGG GCGTGGCGGT GAAAATCAAG GACATCGACG ATACTCCCAG CCTGCCGATG 
ACCGGGGCGG CCTACCTACC AGGCGTCGCC TATGACGAGA TGATCGCCAG GGGCGGCGAC
GTGCGGACCC ACTACGCGGC GTTGCAAAGC CGGATCTCCA CGCTGGGGGC CGGCGAGCTG
GCCGACCGCC AGCGCATGCT CGAACGGTCC TTCCTGCTGC AGGGCATCAC CTTCACGGTC
TACGGCGCCG ACAGCGCCAC GGAACGGATC ATCCCGACCG ACCTGTTCCC CCGCATCCTG
CCGGCCCAGG AGTGGGCCAA GATCGAGGCC GGTCTCATCC AGCGTCTGCA GGCGCTGAAC
ATGTTCCTGG CCGACATCTA TGGCGAGCAA CAGATCCTGA TGGACGGGGT CGTGCCGCGC
GAACTGGTGC TGGGCGCGCC CTCCTACCGG CGCGAGATGC AGAACGTCTA CGTACCGCAC
AAGTCCTACG CCAACGTCTG CGGCAGCGAC CTGATCCGGG GGCAGGACGG CGAGTTCGCC
GTGCTGGAGG ACAATCTGCG GGTGCCGTCC GGCGTCTCCT ACATGCTGGC CAATCGCGAC
GCCTCCAAGC GCACCTTCCC GGGCGCCTAT CGCGAGGCCG GCGTGCGACC GGTCGAGCGC
TATCCCGACT TGCTGCTGGC GACGCTCAAG AGCATGAGCG CCGACTGGCG GTCCGATCCT
CAGGTCGTGG TGCTGACCCC CGGGGTCTAT AATTCGGCCT ATTACGAGCA CGCCTATCTG
GCGCGACTGA TGGGCGTGCC GCTGGTCGAG GGTCGCGATC TCGTGGTCCA TGACAACATG
GTCTACATGC GCACCACCAC CGGCCTGCGC CGGGTGGACG TGATCTACCG CCGGGTCGAC
GACGACTTCA TCGACCCCCT CGCCTTCCGC CGCGATTCGT CGCTGGGTGC CGCGGGTCTC
TTCAACGCCT ACCGGGCCGG CAATGTTGTC ATCTGCAACG CGCCGGGCAC CGGGGTCGCC
GACGACAAGG CGGTCTACGC CTTCGTGCCC GACATCATCC GCTACTATCT GGGCGAGGAC
GCCATCCTGC CCAATATCGA GACCTTCCTG TGCCGCGAAC CGGCGCAGTT GAGCCATGTG
CTGGCCAATC TCGACAAGTT GGTGGTCAAG GCCGTGGGCG CGTCCGGCGG CTACGGCATG
CTGATTGGGC CGCACGCCTC GGCCAAAGAG CGATCCGAGT TCGCCGACGC CCTGACGGCC
GATCCCGCCA ACTATATCGC CCAGCCGACC ATCCAACTAT CGACCGCCCC GTGCCTGGTC
GATGGCCGGA TAGAGCCGCG GCACGTCGAC CTGCGGCCGT TCATCCTGTC GGGCGAGAAG
ACCGTTGTCA CACCTGGCGC CCTGACCCGC GTAGCCCTGA AACGCGGCTC CCTGGTGGTC
AATTCCAGCC AAGGCGGAGG CTCGAAGGAC ACCTGGGTGC TCTCCGAAGA GTCGCCGTCG
AACGGTGCGG GAGGACTGGC ATGA
 
Protein sequence
MDTGVAVKIK DIDDTPSLPM TGAAYLPGVA YDEMIARGGD VRTHYAALQS RISTLGAGEL 
ADRQRMLERS FLLQGITFTV YGADSATERI IPTDLFPRIL PAQEWAKIEA GLIQRLQALN
MFLADIYGEQ QILMDGVVPR ELVLGAPSYR REMQNVYVPH KSYANVCGSD LIRGQDGEFA
VLEDNLRVPS GVSYMLANRD ASKRTFPGAY REAGVRPVER YPDLLLATLK SMSADWRSDP
QVVVLTPGVY NSAYYEHAYL ARLMGVPLVE GRDLVVHDNM VYMRTTTGLR RVDVIYRRVD
DDFIDPLAFR RDSSLGAAGL FNAYRAGNVV ICNAPGTGVA DDKAVYAFVP DIIRYYLGED
AILPNIETFL CREPAQLSHV LANLDKLVVK AVGASGGYGM LIGPHASAKE RSEFADALTA
DPANYIAQPT IQLSTAPCLV DGRIEPRHVD LRPFILSGEK TVVTPGALTR VALKRGSLVV
NSSQGGGSKD TWVLSEESPS NGAGGLA