Gene Caul_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1378 
Symbol 
ID5898833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1461778 
End bp1464396 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content65% 
IMG OID641561865 
ProductTonB-dependent receptor 
Protein accessionYP_001683006 
Protein GI167645343 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.571563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC GTTCGCGAAA TATCCTGCTT GCAGGGATTT CGACCGTCAG CGTATTGTTG 
GCCTGTGGTG TGAATGCCTA TGCCGCTGAC GTGCCCGATG CGGCGCCCGA CGCGACCGCG
GAACTCGACA CGCTGATCGT CACGGCGCGC GGCAAGCCCC GTACGGTTCT GGACTCGGCC
GTTCCGGTCG ACTCCTTCAG CGAAGCGGAT CTGAAGGCTT CGACCTTCAC CGACACCAAC
GACATCCTCA AGACCCTGGT CCCGTCCTAC ACCCTGGCGC GCGAGCCGAT CTCGGACGGC
GCCACCTTCA TCCGCCCCGC GTCCTTGCGC GGCCTGCCCA CCGACAAGAC CCTGTTCATG
GTCAACGGCA AGCGCCGCCA CCGCTCGGCC CTGGTCAGCA TCGGCGGCAC CGGCGCCCAG
GCGCCCGACG CGGCGACCAT CCCCGCCTCG GCCCTGAAGA ACGTGGTCGT GACCCGTGAC
GGCGCCGGCG CGCAGTACGG CTCGGACGCC ATCGCCGGGG TCATCGACTT CCAGCTCAAG
GACAGCCCCA GCGGCGGTTC GCTGACCGCC CAATACGGCC AGTACTATCT GGGCGACGGC
GAGGACGTGC TGCTGACCGG CAACCTGGGC CTGCCGATCG GCGACAAGGG CTTCGTCAAC
GGCACCTTCG AATATACCAG CCAGAACCAG GTCAACCGCG GCAGGCAGTA CTGCAACAAG
GGCATCCCCA ACCAGTCGGC CGGCTTCTGC GTCGCCGACT ACGCGGCCAC CAACCCGGCC
TACGGCGCCC TGATCCACGA CTATGTGCAG AAGTGGGGTC AGCCCGACGC CGAAGCCACG
CGCGGCGTGA TCAACGCCGG CTACGAGTTC TCGGAAGCCG TGTCGGTCTA TGGCTTCGCC
AACTATTCCA AGAGCTCGGC CACCGAGTAC TTCAACTATC GTCCGCCGGT CAGCAACGCG
GTCAACGCCA CGCCGATCCG CAAGTCCGAC GGCAGCCTGT TCCAGTTCAG CTCGATCTTC
CCGGCCGGCT TCACGCCGAT GTTCGGCGGC GACATCACCG ACTACAGCCT GACGGGCGGC
TTCAAGGGCA AGTTCGCCAG CGGCCTGACC TATGACATCA GCGGCCGCTA CGGGAACGAC
AAGCTGTCCT ACACCCTGTG GGACACCGTG AACCCGTCGA TGGGGCCGGA CTCGCCGAAG
TCCTTCTATG TCGGCTCGCT GATCTCGACG GAAACGGCCG CCAACGCCGA CTTCGCCTAT
GACTGGGGTG TCGGCGGCTT CGCGACCCCG ATCACGATCA ACTTCGGCGC CGAATACCGC
AAGGAAGGCT ACGAGATCGA GGCGGGCGAC GTGCCGTCCT ACAAGGCCGG GACCTGGGCC
GTGCCCAATC CGTTCGGCTT CTGCGACACG ACAACTCATA CGCCGACCGC CGCCGCCACC
GCGACGGTTC TGGCCAATGG CCTCAACTGC GCCAACTACA AGTCGGACGC CACCGACGGC
TTCGCCGGCA TCGACCCGGT CTATAACGCC CTGGCCGTCG GCTCGAACGG CGCTCCGGGC
TCGCCGCCGG ACTATTCAGG CAAGCTGACC CGCGACTCCT ACGCCGGCTA TGTCGAAACC
TCCGCCAATA TCACCGAGGC CTGGTTCCTG GACCTGGCCC TGCGCGGCGA GCACTTCTCC
GACTTCGGCG GCACGGTGAA CGGCAAGGTC GCCACCCAGT ACCAGATCAA CGACAAGTTC
GGGATCCGGG GCTCGATCGG CACCGGCTTC CGCGCCCCGA CCCCGGGCCA GCTGTTCACG
ACCAACGTCT CCACGCGGGT CGAGAACGGC GCGATCATCG CCTCGGGCCT GTTCCCGGCC
ACCAACCCGG TGGCCCAGTT CATGGGCGCC AAGGAGCTGA AGCCCGAGAA GTCGCTGAAC
ATCGCGCTGG GCTTCACCGC CACGCCGATC GACAATCTGA GCCTGACGAT CGACGCCTAC
ACCATCCAGA TCGACGACCA GTTCTACACC ACCACGCCGA TCACGGTGAC GGACACGATC
CGAGCGAACC TGATCGCCAA CAACATCCCC GGCGCGTCCA CCATCGGCCA GGTGCAGTTC
TACCAGAACG CCTTCGACTC CACGACGACG GGCGTCGATG TCGTCGCCAC CTACAAGGTC
GATTGGGAGA ACGGCCAATC GACGGCCTTC ACCGCCAGCG GCAACTGGAA CAGCTTTGTC
ATCGACAAGG TGTTCTCCAA CAAGTTCTTC GACGGCGAAG GGGTCTACGA CTTCAAGCAC
GCCGCGCCGC GCTGGCGCTC GGTGCTGTCG GCCACGCACG AGGTCGGCAA GTTCAAGGCC
ATCGGTCGCC TGAACATCTG GGGTCCGTAC AAGAACATGT TCAGCGTGGC CAATCCGGTG
ATCCAGAAGT TTGACACCGA AGCCTTCGTC GACCTGGAGC TCAGCTACAA GGCCACCGAC
ACCTACACCG TGTCGCTGGG CGCCCGGAAC CTGTTCGCCA ACTACCCCGC CATTGACAAG
ACCGGTGAGT CGGCGACCAA CGGCCGTCTC TACCGCTCGG ACTCGATCGT CGATTGGCAG
GGCGGCTTCT GGTACCTGAA GGCCTCGGCG ACGTTCTAG
 
Protein sequence
MNNRSRNILL AGISTVSVLL ACGVNAYAAD VPDAAPDATA ELDTLIVTAR GKPRTVLDSA 
VPVDSFSEAD LKASTFTDTN DILKTLVPSY TLAREPISDG ATFIRPASLR GLPTDKTLFM
VNGKRRHRSA LVSIGGTGAQ APDAATIPAS ALKNVVVTRD GAGAQYGSDA IAGVIDFQLK
DSPSGGSLTA QYGQYYLGDG EDVLLTGNLG LPIGDKGFVN GTFEYTSQNQ VNRGRQYCNK
GIPNQSAGFC VADYAATNPA YGALIHDYVQ KWGQPDAEAT RGVINAGYEF SEAVSVYGFA
NYSKSSATEY FNYRPPVSNA VNATPIRKSD GSLFQFSSIF PAGFTPMFGG DITDYSLTGG
FKGKFASGLT YDISGRYGND KLSYTLWDTV NPSMGPDSPK SFYVGSLIST ETAANADFAY
DWGVGGFATP ITINFGAEYR KEGYEIEAGD VPSYKAGTWA VPNPFGFCDT TTHTPTAAAT
ATVLANGLNC ANYKSDATDG FAGIDPVYNA LAVGSNGAPG SPPDYSGKLT RDSYAGYVET
SANITEAWFL DLALRGEHFS DFGGTVNGKV ATQYQINDKF GIRGSIGTGF RAPTPGQLFT
TNVSTRVENG AIIASGLFPA TNPVAQFMGA KELKPEKSLN IALGFTATPI DNLSLTIDAY
TIQIDDQFYT TTPITVTDTI RANLIANNIP GASTIGQVQF YQNAFDSTTT GVDVVATYKV
DWENGQSTAF TASGNWNSFV IDKVFSNKFF DGEGVYDFKH AAPRWRSVLS ATHEVGKFKA
IGRLNIWGPY KNMFSVANPV IQKFDTEAFV DLELSYKATD TYTVSLGARN LFANYPAIDK
TGESATNGRL YRSDSIVDWQ GGFWYLKASA TF