Gene Caul_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2137 
Symbol 
ID5899592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2308396 
End bp2311437 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content64% 
IMG OID641562626 
ProductTonB-dependent receptor 
Protein accessionYP_001683763 
Protein GI167646100 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.255977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.143212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCAAT TCGCACACGC GCGGCGCGGC ATGTTGCTGG CGGCCGTTTC GGGACTGGCG 
ATCGCCGGGC CAAGCTTTGC CCAGGCTCAG GACGCGGTGG CGTCGGGCCA GACCACCGAG
AGCGCCTCGA CGGTCGACGA GATCATCGTG ACCGGCATCC GTGCTTCGCA ACAGCGCGCG
GTGAGCATCA AGCGCGAGGC CGCCTCGGTC GTCGACGCCA TCTCGGCCGA AGACATCGGC
AAGCTGCCCG ACAACACCAT CTCCGACTCG CTGCAACGGA TCCCGGGCGT TCAGATCCTG
CGCAGCGCCG GCGAGGGCTC GACCGTCAAT ATTCGCGGTC TGCCGCAGGT CTCCACGCTG
CTGAACGGCG AAGTCTATCT GGGCGCCCAG TCGATCACGA CCGTCCAGCC CAACTTCAAC
GACATCCCCT CGCAGCTGTT TTCGGGTGCG GACGTCATCA AATCGACGAC CGGCGACCAA
CTGAACGCCG GGATCAGCGG TACGATCAAC CTGCGCACGC GCCGGCCCAT GGACCTGAAG
GAGGGCCTAA CCCTCGCGGC CGCGGCCGAA GGCTCCTACG GCGACAAGAC CCAGAAGTTC
GATCCGAACG TCAACGGCCT GATCTCCTTT CACAACGACC GGTTCGGCGC CCTGTTGTCG
GCCGCCTATT CCGACGTGCG CCTGTCGAAC TCGCACAACG GCATCCAGGA AACCTACGGC
GCGACACTGC ATAACGAGAG CACGGCCGAC GCGACCTCCA GCGGCGGCTT CTCGCCGACC
AAACGCCCGC ACGGCACGCC GGTGGCCGGC GGCATCGACG TCAACGGCGA CGGCGACGCC
AACGACGCCT TCATTGTTCC CCAGGGCTTC ACCGGCTGGA ACAAGATCAA CCAGCGCGAA
CGCCTGGGCG TCAACGCCTC GGCCCAGTGG AAGATCAACG ACGCCCTGGA GCTGACCGGC
GACGCCTTCT TCACCAAGCA GGACGAGCAC GACCGCACGG CCGGCTTCCA GATGCAGGAC
GTCAACTGGC AGGCCGCGGA ATTCACGCCG GGCCAGTCGC GTGACACCGG GACGATCGTC
AACAGCTATC ACTTCAACAC CACCCAGATT TACAACTACG ACCTGGGCAA CTTCGACTCC
TACGCCCAGA CCGACCGCTA TCAATCCAAG TCGCAGAACT ACAATCTGGA ACTGAAGTAC
GATAACGGCG GCAAGTTCAC CGGGTCGGTG CGCGGCATCT ACGGGAAGGC CCACCAGGAC
TACGATCAGA GCTATCTGCA GTTCAGCCTG TCGAACGGCG CCCAATGGCA GCCGGGCGGC
GTGGGCCACT ATCCCGCTTC GCTGGGCGGC GATCGGCCGT TCAACACCGG CGGCTATGCG
GTCAACACCA TCGCCGGCGC GGCCTCCTTG CCCGCCAAGG TGGACTATAC CGGCAACAAG
CCGGTCTTCA CCCTGCCCAG CCAATTGCTG ACTGAACTGG GCGACATCAA CAGTTATGCG
CTCAAGACCA TTTCCTCGGA AGGCAACTAC CGCCGCGAAG GCGACCTGAA GGTCATCCGA
GCCGACGGCA AGTACGAGTT CAACGACAGC TTCAAGCTGT CGGCCGGCGC GCGCTATTCC
GAGCGCTCGG TCGACGACTT CGAGTTCGAT CGCGCCGCCC CGCTCTACGG CAGCGCCGCA
TCGAACGGCA CGGGCTGCCT GGTCAAGTGG AAGGCCTTCG ACGTCCCGCT CAGCGACAGC
AGCTGCAGCG CCGGCAACGC CGCGGGCTTC TACACCGCCG GTCTGACCCG CAAGGCCAAT
GACCCGACCC TGAACGGTGA AGTCAAGCTG TTCAACCCCG GCGTCGCGGG CGTGCCGTCG
ATGTACGTGC TCGACCCGAA GGCCATGGAC CACGCCCTGG CGTTCCAGAA CCGCTTCTAT
CCGGGCAATG TCGAGATCAT GAACCCGGGC GCCTCGTTCA ATGTCGGCGT CAAGCAGACC
TCGGCCTATC TGCAAGCCGA CTTCAAGGGT GAAGTCTTCG GCCTGGGCTT CACCGGCAAC
GCCGGCGTCA AGGTCATCCA GACCAAGCTC GACATCACCC AGTACGTCAC CGGCAGCCCG
CGCCCCTACG GCGTGGCCAA CCTGCTGGCC GGCAGCGTCG AGACCAACCG CAAGTTCACC
GACGTCCTGC CGGCGATGAA CGTCGCCTTC GATGTCGCCG AGAACGTCAA GCTGCGCTTC
GCCGCTTCCG AGACCATGAC GCTGCTGGAT CTGAACCAGT GGGGCGGCGG TCTGAACCCG
ACCTACGCCA TCGACACCAC CAATCCCGGT TCGCCGGTGT TCCGCGTCAC CGGCGGCAGC
CAGAACGGCA ACCCCGCGCT CGATCCCTGG CGCGCCAAGA ACTTCGAAGG CTCGCTGGAG
TATTATCTCG GCAGCGCCAG CATGCTGAGC GTCGGCGCCT TTTACATGAA GGTCGACAGC
TTCATTCAGA ACGGCTCGAT CGTCCGCACC GACCTGCCCG ACAACGACGG GGTGGTGCGC
AACCGCACCG TCTCGATCAG CACCCAGGTG CAAGGCGACG GCGGTACGCT GAAGGGTCTG
GAAGCCGGCG CCAAGCTGGC CTTCAACGAC CTGTCGTTCA TGCCCGCGAT GCTGTCGAAC
TTCGGCGTCG ACACCAACTT CACCTACGCG CCGTCGAAGT CGGGCAAGAA AGATCTGGCC
GGGGCCTCGA TCCCCTTCCA GGACAACTCG AAGTACCAGG CCAACCTCGC GGCCTACTAT
CAGGACGACA GGCTGCAGGC CCGGATCGCC TGGAACTACC GCTCCCGCCG CGCCGTGTCT
CAAGACTTCG GCGGAACCAC GGGACTGGAA ATGTACCAGG CCTCGACCAA CTATCTCGAC
GCCTCGGTCA GCTACGACGT CAAGCCGAAC CTGACCGTCT ACGTCCAGGG CACCAACCTG
ACCAGCGAGT ACGAGAAGTA CTACCTCACC TGGAAGGACG AGCACGCCTA CAACAACGTG
TTCGAGGCCC GCTACGTGGC TGGCGTCCGC TTCAAGTATT GA
 
Protein sequence
MNQFAHARRG MLLAAVSGLA IAGPSFAQAQ DAVASGQTTE SASTVDEIIV TGIRASQQRA 
VSIKREAASV VDAISAEDIG KLPDNTISDS LQRIPGVQIL RSAGEGSTVN IRGLPQVSTL
LNGEVYLGAQ SITTVQPNFN DIPSQLFSGA DVIKSTTGDQ LNAGISGTIN LRTRRPMDLK
EGLTLAAAAE GSYGDKTQKF DPNVNGLISF HNDRFGALLS AAYSDVRLSN SHNGIQETYG
ATLHNESTAD ATSSGGFSPT KRPHGTPVAG GIDVNGDGDA NDAFIVPQGF TGWNKINQRE
RLGVNASAQW KINDALELTG DAFFTKQDEH DRTAGFQMQD VNWQAAEFTP GQSRDTGTIV
NSYHFNTTQI YNYDLGNFDS YAQTDRYQSK SQNYNLELKY DNGGKFTGSV RGIYGKAHQD
YDQSYLQFSL SNGAQWQPGG VGHYPASLGG DRPFNTGGYA VNTIAGAASL PAKVDYTGNK
PVFTLPSQLL TELGDINSYA LKTISSEGNY RREGDLKVIR ADGKYEFNDS FKLSAGARYS
ERSVDDFEFD RAAPLYGSAA SNGTGCLVKW KAFDVPLSDS SCSAGNAAGF YTAGLTRKAN
DPTLNGEVKL FNPGVAGVPS MYVLDPKAMD HALAFQNRFY PGNVEIMNPG ASFNVGVKQT
SAYLQADFKG EVFGLGFTGN AGVKVIQTKL DITQYVTGSP RPYGVANLLA GSVETNRKFT
DVLPAMNVAF DVAENVKLRF AASETMTLLD LNQWGGGLNP TYAIDTTNPG SPVFRVTGGS
QNGNPALDPW RAKNFEGSLE YYLGSASMLS VGAFYMKVDS FIQNGSIVRT DLPDNDGVVR
NRTVSISTQV QGDGGTLKGL EAGAKLAFND LSFMPAMLSN FGVDTNFTYA PSKSGKKDLA
GASIPFQDNS KYQANLAAYY QDDRLQARIA WNYRSRRAVS QDFGGTTGLE MYQASTNYLD
ASVSYDVKPN LTVYVQGTNL TSEYEKYYLT WKDEHAYNNV FEARYVAGVR FKY