Gene Caul_3252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3252 
Symbol 
ID5900707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3517697 
End bp3520093 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content66% 
IMG OID641563757 
ProductTonB-dependent receptor plug 
Protein accessionYP_001684877 
Protein GI167647214 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.970825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGTA GGAACGGCTT GCGTCGCGCG CTTTTGGCGT CGGCGTCCAT CATTCTTGCG 
GCGGGCGCCA TGGGATGGGG AACTGGCGCA ATCGCACAGG ACGGCGGAAC CAGCAACACC
AGGGCCGGCG CCGCGGCGTC GCAGGCCGAG ACCCAGGGGC CTCAGCTTGA GGAAATTGTC
GTCCAGGCCC GCCGGGTCGA GGAATTGCAG CAGGCGGTGC CGATTTCGAT TTCGACCGTC
ACGGCCAATC GACTGGCGGA CGCCGCCGCC AGTTCCGTGG CCGACATTCA ACGGATCGTG
CCCTCGCTGC AGGTCGGCCA GGACTCCTCA GGCCAACAGA ACTTCATCAT TCGCGGTTCG
TTCGGCGGCT TCGGCAATGA TCCGGCCGTC ATCACCTACA TCGACGAAGT GCCCACCGAG
TCGCGCACGC TGGTCTATTC CTTGTTCGAC CTGGGATCGG TCCAGGTGCT GAAGGGCAGC
CAGGGCACGC TTTTCGGCCG CAACAGCACC GGCGGCGCGG TGCTGTTCTT CAACCAGCGT
CCGCGCCTGG CGCAAACCGG CGGCTATCTC AGCGGTCGCT ACGGCAACAT GAACGAACGT
CGCCTCGAAG GTGCGGTCAA CCTGCCCATC GGCGACAATC TCGCGGTCCG GGTCTCTGGC
CAGGTCGAAC GTCGCGATGG CCTGCTCAAG AGCGTCACCG CGCCTGGCCT GGACTTTGGT
GACCGGCACA ATCAGGCCCT GCGCGCCTCG GCCCTGTGGC AGCCCAACGA CGCCATCGAG
AACTACACCC AGCTGACCCA CTATCGGCAG CGGGAAAACG CGCCGGCCCA GGTGCTCTAC
AGCCTGGCCG GCCCTTGCAC CGGTCCGACC ACGCCGGCCC CGGTCTGCCT CTACCAGCCG
CCGTTCAGCA GCTTCCTGGG GACGGGGAAC GTGAGGGCAT GGTTCGACCA GGAACAGGCC
CTGCCCGCCG GTCTGACGGT CAACAACAAC CCCAATCTCG ACTCTGTCGA TCGAGACTCG
GTGACCAACG CCTTCACCGC CGACCTGGGC AAGGTGAGCG TTCGCAACAT CGTTCACTAC
GGCGAAAGCA CCATCCGCTT CACCAAAGAC TATGACGGCA CGCCGGTGCG CATCTTCGAC
GCCGACCACC ACGACGAGAT GCGCAACTTT TACACCGAGA CGCAGCTCTT TGGCCGCGCC
CTCGGCGATC GCCTCAACTG GCGCGTGGGC GGCGTCTACA GTCACGATCG CGACGAACAG
GCCGCCACCA CGATCGTTTT CCCGCTGCCA GCGTCGATCA CCCAGCCGCG GACCGGCCTG
TCCGACCAAA CCAACAAGTC CTCGGCGGCC TTCGTCCAAG GCGCCCTGGA TCTATCGGAC
TGGCTGCGGG GCGTTTCGCT GACGGCCGGC TACCGTCACA CGTGGGACGA CCGCAAGCTC
GTTCAGCAGA TCCATACAGG CCAGCCAACC CCGGTCTGCG CGCTTCAGAC CCTGCCCGTC
CCCACGACCG GTCCGGTGCC ATTCCCGAGC ACGGACCTGG CGACCTGTAC GCGCCACCTG
AAACTGAAGG CCGACGATTC CAACTACAAC CTCACCCTCG ACTGGAAGCC GACCGACAAG
ATTCTGTTCT TCGTCTCCAG CCGCCGTGGC TACAAGGCCG GCAGCTTCAA CCTGCTGGCC
AATGATCCGG CCCTGGTCGC CTATGCGCCG GAGGTCGTCA ACGACCTGGA GGCGGGATTG
AAGGCCGACT GGAAGATCGG CGCTGTGCCG GTCCGCACCA ACGTTTCGGT GTTCCAGTCC
AAGTACACCA ACATCCAGGT GTCGACGGTC CGGGTCAATC CGGCCAATGG CGACATCAGC
GTGCTGATCC TCAATCAGGA CCCCGCGACG GGCCTGTCGA ACAAGGCCAC CGTCAAGGGG
TTCGAGGTGG AGGTGACGGC CGCGCCGACG CGCTGGCTGG AGCTGTCAGG CTTCTATTCG
AAGACCGACT CCACCTACGA CCGCTTCACC AATCCGGGCA CGACCCAGAA TCTCGCCGGC
CAGAAGGTCA GCGGCGTCAT ACCCAAGACC TACGGCGCGA CGGTTCAGGC GCACCTTCCC
CTCACCGGCG TCGCCGAAGA GATCGCCCTG ACGGCCAGCT ACTACGTGCG TGGCGCGCCT
CAGACCAACG TCACCTCGAC GTCGTTCGAG GACAAGCGAT CGTCCTTCGA CGCACGGCTG
GGCCTGCGCA ACCTGTTCGG CAGCGGCGCG GAGCTGGCGG TGTTCGGCAA GAACCTCGAC
GATGAGATCG CCTGCCCGAC CAACGCCGCC GTCACCGGGG CGCCGACCCG TCTGTGCGGC
GAGGGCCGCA CCTACGGCGT CGAGCTGACC TACCGGTTCG GGGGCGAGCG CCGATGA
 
Protein sequence
MSGRNGLRRA LLASASIILA AGAMGWGTGA IAQDGGTSNT RAGAAASQAE TQGPQLEEIV 
VQARRVEELQ QAVPISISTV TANRLADAAA SSVADIQRIV PSLQVGQDSS GQQNFIIRGS
FGGFGNDPAV ITYIDEVPTE SRTLVYSLFD LGSVQVLKGS QGTLFGRNST GGAVLFFNQR
PRLAQTGGYL SGRYGNMNER RLEGAVNLPI GDNLAVRVSG QVERRDGLLK SVTAPGLDFG
DRHNQALRAS ALWQPNDAIE NYTQLTHYRQ RENAPAQVLY SLAGPCTGPT TPAPVCLYQP
PFSSFLGTGN VRAWFDQEQA LPAGLTVNNN PNLDSVDRDS VTNAFTADLG KVSVRNIVHY
GESTIRFTKD YDGTPVRIFD ADHHDEMRNF YTETQLFGRA LGDRLNWRVG GVYSHDRDEQ
AATTIVFPLP ASITQPRTGL SDQTNKSSAA FVQGALDLSD WLRGVSLTAG YRHTWDDRKL
VQQIHTGQPT PVCALQTLPV PTTGPVPFPS TDLATCTRHL KLKADDSNYN LTLDWKPTDK
ILFFVSSRRG YKAGSFNLLA NDPALVAYAP EVVNDLEAGL KADWKIGAVP VRTNVSVFQS
KYTNIQVSTV RVNPANGDIS VLILNQDPAT GLSNKATVKG FEVEVTAAPT RWLELSGFYS
KTDSTYDRFT NPGTTQNLAG QKVSGVIPKT YGATVQAHLP LTGVAEEIAL TASYYVRGAP
QTNVTSTSFE DKRSSFDARL GLRNLFGSGA ELAVFGKNLD DEIACPTNAA VTGAPTRLCG
EGRTYGVELT YRFGGERR