Gene Caul_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1967 
Symbol 
ID5899422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2108530 
End bp2111319 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content70% 
IMG OID641562457 
ProductTonB-dependent receptor plug 
Protein accessionYP_001683594 
Protein GI167645931 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.559589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTCT GGCGCGTCAC GGCTCACGGG ATCTGGGCGA CGGCGCTCCT GGCGGCGGCG 
CCGAGCCTAG CCCGCGACGT CGAACCGCGC CGCGCGTTCG ATCTGGAGGC CGCGCCGCTG
GGCAGGGCCT TGTTGGCGTT CAACCTGGCG ACCGGCGCCC AGGTCATCGT CTCGCGCGAC
CTGCTGGTCG GCCGCGTCTC GCGCCCGGTG CGCGGTCAAT ACACCGCCAC CCAGGCCCTG
GCGCGCATGC TGACCGGATC GGGTCTGCAT GCGGAGCGCA CGGGGCGCGG CGTCCTGATG
ATCCTACCCG ACCGGCCGCC GGCGGCTCCA AGGCCCAGCC CCAGCCCCAG CGCGCCCGAA
GCCGTCGCCA TCGCCGAGAT GGAGGGCGTG ACGGTCACCG CGCCCCGCAT CCTGCCGCCC
GTCGCTCTGA CCACGCTGAG CGGCGCGCGC CTGGAGCAAC TGGGCGTGGT TGGCATGGAT
CAGGTCGGGC GGCTGACGCC GGGCCTCAAT GTCGTCAATC TCTCCCCGGC CGCTGCGGCC
TTCTCGATGC GCGGCGTCAC CCAGGCCTCG GGCGACGCCA GCCGCGAGCC GAGAATGGCG
GTGCTGCAGG ACGGCATGCC AGCGTCCAAG GAGCGCGGCG CCTATTTCGA GCTCTTCGAT
CTTGACCGCA TCGACATCGC CAAGGGCCCT CAGCCCACGC TCTACGGCCG CGGGGCGATG
GCGGGGGCGC TCGACGTCAT CCAGCACAAG GCCGATCCGC GCGCCAAGGC CTGGAGCCTT
CATGGCGAGG TCGGCGACCA TGGCCAGGGC CTGCTGGACG CCATGGTCAA CCAACCGATC
GGCCAGGACC TAGCGGTGCG TGTCGCCGCT CGCTGGCGAG GACGAGACGG CCTGACGCCC
AACCTGGCTG GCGGCGACCG CCTGGACTCA GTCGCCACCG ACGCCGCCCG CGTCTCACTG
GCCTGGCGTC CCGAGGATGG CGAACGGCTT GACCTGATCG TCAACTACCA GCGGGACCGT
CCCACTGGCC GGCCGCTCAA ATCCTTCACC TTCGCGCCCA CCGACCCGGT CACGGGCGCG
GCTCTGGGCG ATCTTGACCG CCACACCCCG CCCGCCTTGA GCGCCGTCGA CGAGAACGGC
CGCCCCGCGG CGCTCGGCCT GGACCGTACG CTGGGCTCGG TCGCGGTCCT GGCCGCGCGC
CGGCCGAGCC CGGGTCTGAA CCTGAGCGCG TCTCTGGGCT ATCGCCGCTT CTACGCCGAC
GAGCGCCAGG ACGGCGATGG CCTGTCCCTG CCGGTGATCA GCGTCTTCGA GCAGACCCGG
GGCGTGCAGT ACAACGCCGA CCTGCGCCTG GCCTACGACG CCGGCGAGCG TTGGCGGGGT
TTCGCAGGCG TCAGCCTGTT CCACGAAAGC GGCACCCAGC GGGCCTATTT CACGATCGAC
GAACGCCTGC TCCTGGCGCG CATGGCCGGC GGCCTGGACA CGCCCGTCCC CCAAGGTCTC
GATACGCTGA CCCAGCCGGA CTTCGTGGCC GCGCAGCTGC GCCAACTGGC CGCGCGGCGC
GGCTTCGACC TCCCGGGCGA TCTGGCCCTG GGCGCGGCCG GCAACCTCAA GTCGGCCTAT
GTCGAGCGCA ACCAGAACTT CGCGCGGACC ACCGCCGTCG ATCTCTTCGG CGACCTCACC
TTCGCGCCAA GCCCGACCTG GGCGCTTTCG GCCGGCCTGC GCTACAGCCA CGACGCCAAG
GCCAGCGCGG TCGCCACCGC CCTGCCCGCC GGCCCGTCGG TGCTGGCGGG AGCCGTCCAG
GCCCTTGGCC TTCCCTGGGG GCAGAGGGCC GCCTTGCTCG CCGCGCTCAG CGCGCCAGGC
GCGGGCGCCA GCGCCGAGAC TCCCCGCGCC TGGCCCAACT ACGCCTTGGT GTTCCAACCG
ACATCGGGAA ATGGAGACAA GGTGTCGAAT GCGCTGTCCG ACGAGGGCTG GAGCGGGCGA
CTGGTGGCGC GCTATACGCC ATCGTCGGCG TTCAGCGCTT ACGGCTCCTA CGCCCGGGGG
CGACGGCCCA GCGTCCTGGT GGCCGGACCG CCTCTGGTGG CCGAGGGTCC GGCGCGGTTC
GGAATCGTGC CCGCCGAGAC GATCGACTCC CTGGAGATCG GAACCTGCGG CGTCGGTTTC
GACCAACGCG TGGATCTGAA CCTTGCCGCT TACGCCTATC GCTATGTCAA CTTCCAGACC
ACCAAGCTTG TCGATGGTCA ACTGCAGACA CTGAACGCCG GTCGAGCCGA CGCGACCGGT
CTGGAGGCCG AGGCGCGGTT CGAGCTGTCG CGCTCAGTCC GGCTGTTCGC GGCCTACGGC
TACAACCGTT CAAGACTGCG CAGCGGCGCC TTCGCTGGCA ATCAATTTCG CCTCGCCCCC
GATCACAAGG TCTCGCTCAA CCTGGAACTG ACATACGCCA TGCCGGGCGG CGCGCTCACG
GTCCTGCCGA CTTGGAGCTG GCGATCGAAG GTCTTCTTCT CCGACGACAA CGATCGCCCC
CAACTGTCCC GCGGACTAGT GCGCGATCTG GTGCAAGACG AGGTGCAAGG CAGCTACGAC
TTGCTGGATC TGCGGGTGAA CTATCAGCCG CGCGCCGCCA ACTGGAGCGC GGGCGTCTTT
GTCACCAATC TCCTGGATCG GCGCTATCTT CAGGAAGCAG GCTTCATCGG CGAAAGCTTC
GGATTCACAG CGTCGGCGCA GGGGACGTCG CGTCTCTGGG GGCTATCTTT ACGTATCGTG
AGTGGAGCGA GGCAGGACTT CGCGAAATAG
 
Protein sequence
MRFWRVTAHG IWATALLAAA PSLARDVEPR RAFDLEAAPL GRALLAFNLA TGAQVIVSRD 
LLVGRVSRPV RGQYTATQAL ARMLTGSGLH AERTGRGVLM ILPDRPPAAP RPSPSPSAPE
AVAIAEMEGV TVTAPRILPP VALTTLSGAR LEQLGVVGMD QVGRLTPGLN VVNLSPAAAA
FSMRGVTQAS GDASREPRMA VLQDGMPASK ERGAYFELFD LDRIDIAKGP QPTLYGRGAM
AGALDVIQHK ADPRAKAWSL HGEVGDHGQG LLDAMVNQPI GQDLAVRVAA RWRGRDGLTP
NLAGGDRLDS VATDAARVSL AWRPEDGERL DLIVNYQRDR PTGRPLKSFT FAPTDPVTGA
ALGDLDRHTP PALSAVDENG RPAALGLDRT LGSVAVLAAR RPSPGLNLSA SLGYRRFYAD
ERQDGDGLSL PVISVFEQTR GVQYNADLRL AYDAGERWRG FAGVSLFHES GTQRAYFTID
ERLLLARMAG GLDTPVPQGL DTLTQPDFVA AQLRQLAARR GFDLPGDLAL GAAGNLKSAY
VERNQNFART TAVDLFGDLT FAPSPTWALS AGLRYSHDAK ASAVATALPA GPSVLAGAVQ
ALGLPWGQRA ALLAALSAPG AGASAETPRA WPNYALVFQP TSGNGDKVSN ALSDEGWSGR
LVARYTPSSA FSAYGSYARG RRPSVLVAGP PLVAEGPARF GIVPAETIDS LEIGTCGVGF
DQRVDLNLAA YAYRYVNFQT TKLVDGQLQT LNAGRADATG LEAEARFELS RSVRLFAAYG
YNRSRLRSGA FAGNQFRLAP DHKVSLNLEL TYAMPGGALT VLPTWSWRSK VFFSDDNDRP
QLSRGLVRDL VQDEVQGSYD LLDLRVNYQP RAANWSAGVF VTNLLDRRYL QEAGFIGESF
GFTASAQGTS RLWGLSLRIV SGARQDFAK