Gene Caul_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2995 
Symbol 
ID5900450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3254322 
End bp3257123 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content62% 
IMG OID641563492 
ProductTonB-dependent receptor 
Protein accessionYP_001684620 
Protein GI167646957 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGC GATATCCCGT TCAGGCCATG CGCCCGAACC ACAAGATCTT TTTGCTGGCC 
ACCGTATCGG CGCTGGTGGT GGGCGCCAGC TCATCGGCCG CTCTGGCGCA GCAGGCCTCC
GGTGAGTCGG CGTCTGGCGA CATGGTCGAC CAGGTGGTCG TCACCGGCTA CCGGAAGTCG
CTGAGCGATG CGCGCGCCAT CAAGAGGGAT TCAGTCATCC AGAAGGACGC GATCGTCGCC
GAAGACATGG CGAAGTTCCC GGACCTGAAC CTGGCCGAAT CGTTGCAGCG CCTGCCGGGC
GTGCAGATCA CCCGCGAGGC GGGCGAGGGC AGGCGCATTT CGCTGCGCGG CCTCGGCCCG
GATTTCAGCC GCGTGCAGCT GAACGGCATG GAAGTGCTGG GCAATGTCGA CTCCGCCCAG
GACAGCCGTG GCCAGCGCTC GCGCGACCGC GCGTTCGACT TCAACATCTT CGCTTCGGAA
CTGTTCTCGA AGGTGGAGGT CGAGAAGACC TTTGAAGCGG CCCAGAACGA GGGCGGCATG
GCCGGCACCG TCGGCCTGTT CACCGGCAAG CCGTTCGACT ACGCGGCGGG ATCCAAGGGC
GCGGTGTCGC TGAAGCTGGG CACCAACGAG TACACCAAGG ACACCCAGCC GCGCATAGCC
GCCTTGTTCA GCCAGAACTG GGACAACAAG TTCGGCGTGG CGCTCTCGGT CGCCTACTCC
AAGCGCGAGA CCACCGAGCA GGGCCACAAT ACCTACAATT ACGACCGGTT AAGTTCTGCT
GCCTTGCAGA AGCTAGTCAC CAACGGCCTG AATATCTCCC ATCTGAGCGC CGCGCAGCAG
GCCAAGTTCC TGTCCGGGGA CCTGTATTTC GCGGACGGTA ACCGCATCTC CTCCTGGAAC
GCGAAGCAGG AGCGCCTCGG CCTGACCGGC GCTGTGCAGT GGCGGCCGAT GGACAATCTG
CTGTTGACGC TGGATGCGCT GCACGGCGAA TTCACCACCC ACCGCGACGA GTATCACCTG
GCCACGCGAC CGCTGGGATC CGGGACGAAG TCCTTTGCGT TCGACACGCC CGCCGGCGGG
GTCTGGCCGG CGGCCTTCCA GACGGGGTCG GTCATCAACG ACCTGACGTG GGATAGCAGC
AACTACGTCA CCAAGACCGA CGTCACCGGC ACGACCTTCG GCAGCGAGCA TCGCCGGTCG
CTGAACGAGA ACCGCTTCAA CCAACTGGCC CTGACCGGCA AGTGGGACGC GACCGATCGC
CTGACCATCG ACGGCCATGT CGGCTATGAA AAGTCGACCT ACAAGACCCC CTATGACGAC
AAGCTCTACA TGCGCGCCAA GGGCAATATG GTCGCCAACT ACGGTACGGA CGGCCAGTCG
GCCACGTTCA GCTATCCGGG CTTTAGCGCC ACCAACCCGG CCAACTACGC GATGGACTCG
TTCTACTACC GCAGCTTCAA CAATGAATCG GGGCTGCGCG AGGGCGTGCT GAACCTGCGC
TACGAACTGT CCGACGTCTT CACCCTGCGC GCGGGCGTGG CCTACCACCG CTTCTCGCAA
GAGGGCATGG ACCTGTTCTA CGACGACAAC GTCAATGGAA CCCGGTCCAA GATGCGCGGC
ACGTCCGTCG CCGACGTCAC TTCGGTATTC ACGAACGAAT TCGGATCGTG GCTGGTCGGC
GACTACGGCA AGGCCTTCGC GAAGTACAAG GAGTACCACC GGCTCGGGGC CAATACCGAC
GGGACGGGCG GGACGCTGCA GGACATCGAG AACGTCTACA AGACGTCCGA AGAGACGGTT
TCCGAATATG TGCAGGCCGA CTGGGACAGC GAACTGTTCG GCAAGCGCTT CCGCGGCAAT
ATCGGCCTGC GCGGCTACAG CACCGACACC CACAGCACCG GCTGGATCCA GGGCGACAGC
TACGCCTATC TCGGCACGAC CGACGTCAAG GGCAGCTATG AAGGCGTCCT GCCGGCCCTC
AACACCGTGC TGGACCTGAC GCCGGAAGTG CTGGTGCGCT TCTCCGCCAC CCAGAACCTG
AACCGTCCGA GCCTGGGTTC GATGGCGGCC AAGGGCAGCG CGTTCCAGAA TGATAGCGGC
GATATCAGCG CCTCTCGCGG CAATCCCGAC CTCAAGCCCT TCAAGGACAC CACGCTGGAC
CTGTCGCTGG AGTACTATTT CGGCAAGTCA GGCCTGCTTT CGGCGGGTGT GTTCCGCAAG
GACATAACCA ACTTCATCAC ATCGACGACC CTTCACAACA TCCCCTTCAG CCAGACGGGG
GTGCCCTACA CCACCATACC GGGCGCGACG GCCAGCACCA TCGTCAAGGA CTTCGATGTT
CCGACCAACA GTTCGGACAA GGTGAAGCTG ACCGGCGTTG AACTGGTGGC GCAAGGCCAG
TTCTCGTTCC TGCCAGCGCC CTTCGATAAT CTCGGCGGCG TGGCGAACTA TACCTATGTG
GACTCAAATT CGGATCTCAC TGGCATTTCC AAGTCCAGCT ACAATCTCAC CCTCTACTAT
GAAACCGACC GTTGGGGCGC CCGCGGCTCG GTGAGCCACC GCACCCGCTG GTACACCGGC
TATAACAAAG ATGTCATGAG CGCCGACACG CGAGGCTTCG AGGGGTCCAC CTATGTGGAC
GCTTCGGCCT TCTTCAATGT CACCGACAAG ATGCAAGTCT CGTTGAACGC GATCAATCTG
ACCAACCAGA AGGACACCCA GTTCTGGGGC CAGAACCGCT ATCTCTATAA TCAGAACCAG
AGCGGCCGGA CCTACATGAT GGGGCTCAGC TACAAGTTCT AA
 
Protein sequence
MSKRYPVQAM RPNHKIFLLA TVSALVVGAS SSAALAQQAS GESASGDMVD QVVVTGYRKS 
LSDARAIKRD SVIQKDAIVA EDMAKFPDLN LAESLQRLPG VQITREAGEG RRISLRGLGP
DFSRVQLNGM EVLGNVDSAQ DSRGQRSRDR AFDFNIFASE LFSKVEVEKT FEAAQNEGGM
AGTVGLFTGK PFDYAAGSKG AVSLKLGTNE YTKDTQPRIA ALFSQNWDNK FGVALSVAYS
KRETTEQGHN TYNYDRLSSA ALQKLVTNGL NISHLSAAQQ AKFLSGDLYF ADGNRISSWN
AKQERLGLTG AVQWRPMDNL LLTLDALHGE FTTHRDEYHL ATRPLGSGTK SFAFDTPAGG
VWPAAFQTGS VINDLTWDSS NYVTKTDVTG TTFGSEHRRS LNENRFNQLA LTGKWDATDR
LTIDGHVGYE KSTYKTPYDD KLYMRAKGNM VANYGTDGQS ATFSYPGFSA TNPANYAMDS
FYYRSFNNES GLREGVLNLR YELSDVFTLR AGVAYHRFSQ EGMDLFYDDN VNGTRSKMRG
TSVADVTSVF TNEFGSWLVG DYGKAFAKYK EYHRLGANTD GTGGTLQDIE NVYKTSEETV
SEYVQADWDS ELFGKRFRGN IGLRGYSTDT HSTGWIQGDS YAYLGTTDVK GSYEGVLPAL
NTVLDLTPEV LVRFSATQNL NRPSLGSMAA KGSAFQNDSG DISASRGNPD LKPFKDTTLD
LSLEYYFGKS GLLSAGVFRK DITNFITSTT LHNIPFSQTG VPYTTIPGAT ASTIVKDFDV
PTNSSDKVKL TGVELVAQGQ FSFLPAPFDN LGGVANYTYV DSNSDLTGIS KSSYNLTLYY
ETDRWGARGS VSHRTRWYTG YNKDVMSADT RGFEGSTYVD ASAFFNVTDK MQVSLNAINL
TNQKDTQFWG QNRYLYNQNQ SGRTYMMGLS YKF