Gene Caul_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3738 
Symbol 
ID5901200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4052590 
End bp4054578 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content70% 
IMG OID641564261 
ProductOPT family oligopeptide transporter 
Protein accessionYP_001685363 
Protein GI167647700 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCG ACGCCGCCGC GGCTCCACGC TCTGAATTCA CGATCCGAGG GGTCCTTCTC 
GGTATCCTCA TCACCCTGGT CTTCACCGCC GCCCAGGTCT ATCTGGGCCT GAAGGTGGGC
CTGACCTTCG CCACCTCGAT CCCGGCGGCG GTCATCTCGA TGGCCCTGCT GCGGGCCTTC
AAGACCGCGA CGATCCAGGA AAACAACATC GTCCAGACCA TCGCGTCCGC GGCCGGGACG
CTATCGTCGG TGATCTTCGT GCTGCCCGGC CTGCTGATGA TCGGCTGGTG GAGCCACGTG
CCGTTCCTGA CCACCTTCGG CGCCTGCCTG GTCGGCGGGG TCCTTGGGGT GATGTACACC
ATCCCCCTGC GCCGGGCCCT GGTGACCAAT TCCGACCTGC CTTACCCCGA AGGCGTCGCG
GCCGCCGAGG TGCTGAAGGT CGGCACCGGC TCGCGGGCCG GCGCGGCCGA AGGTCAGGCG
GGCCTGATCG CGGTGGTGCT GGGCACCGTC GCCTCGGCCC TGTTCGCCGC CCTGGCCGCC
GCCAAGATCT TCGCCGCCGA GGTGGCGGGC TACTTCAAGG CCGGGGCCGG CGCGACCGGC
CTGGGCGCCT CCAGCTCGCT GGCCTTGATG GGCGCGGGCC ACCTGATGGG CATCACCGTT
GGCGTGGCGA TGTTCGCCGG CCTGGCCATC GCCTGGGGCG TGCTGGTGCC TGTGATGACC
AGCCTGCACC CGATGCCGGA CGTCTCGGTG GCCAAGCACG CCCTGACCGT CTGGAAGACC
GAGGTGCGGT TCATGGGGGC GGGCGTCATC GGCGCCGCCG CTATCTGGAC CCTGGGCAAG
CTGGCCCTGC CGATCTGGTC GGGCCTGATG TCGGCCTTCG AGGCCAGCAA GGCTCGCAAG
GTGGGCGGCG CGGTCATCCC TCGCACCGAG CAGGACATTC CGATCGGCAT CGTCGGCCTG
GTGTCGCTGC TGCTGCTGGC CCCGGCCGGC TGGTTCCTGG CCCACTTCCT GACCGGCGGA
CCGATCACCA GCCTGGCGAC GCCCCTGGTG GCTATCGGCG TCGGCTATGT GCTGATCGCC
GGCCTGCTGG CCGCCGCCGT CTGCGGCTAC ATGGCCGGCC TGATCGGTTC GTCCAACAGC
CCGGTCTCGG GCATCGCCAT CCTGACCGTG CTGGGCGCCT CGCTGATGGT CGGCGTCGTC
GGCCGCGGCG TGATCGGCCC CGACATCGCC AAGGCCCTGG TCGCCTACGC CCTCTATGTC
ACCACCATGG TGCTGGCCGT CGCCGTCGTG GCCAACGACA ACCTGCAGGA CCTGAAGACC
GGCCAACTGG TCGACGCCAC GCCGTGGAAG CAGCAGGTGG CCCTGATCAT CGGCGTCGTC
GCCGGCGCCA TCGTCATCCC CTTCGTGCTG GAACTGCTGA ACCAGTCGAA CGGCTTCTCG
GGCGCGGCCA ACCTGTCGAC CGTGGCCGGC GCCAAGCCGC TGGACGCCCC GCAGGCGACC
CTGATCTCCA CCCTGGCCAA GGGCGTGATC GGCCACGACC TGAACTGGAG CCTGCTGGGC
GTCGGCGCCC TGATCGGCCT GGGCCTGGTG CTGGTCGACA TCATCCTGCG CAAGTCGAGC
AACGGCCGCT TCAGCCTGCC GCCGCTGGGC GTGGGCCTGG CCATCTACCT GCCCAGCGCC
GTGACCGCCC CTGTCGTGGT CGGCGCCCTG GTCGGCTGGA TCTACGACAA GATGGTCGAC
AAGGACAGGA TGGGCGAGGC GGCCAAGCGC CTGGGCGTGC TGATCGCCTC GGGCTTCATC
GTCGGCGAGA GCCTGTTCAA CGTCGCCTTG GCCGGCCTGA TCGTCGTCAC CCAGAAGCCC
GGCCCGCTGG AGGTTCCCTT CGCGCCGTCC GAGCACGTGG GCATGATCTT CGCCCTGATC
GCCGCAGCGG TCGTGGTGGT GGGGCTGTAT GGCTGGGCTA GGCGGTCGGC GAACAAGATC
CAGGCCTGA
 
Protein sequence
MSSDAAAAPR SEFTIRGVLL GILITLVFTA AQVYLGLKVG LTFATSIPAA VISMALLRAF 
KTATIQENNI VQTIASAAGT LSSVIFVLPG LLMIGWWSHV PFLTTFGACL VGGVLGVMYT
IPLRRALVTN SDLPYPEGVA AAEVLKVGTG SRAGAAEGQA GLIAVVLGTV ASALFAALAA
AKIFAAEVAG YFKAGAGATG LGASSSLALM GAGHLMGITV GVAMFAGLAI AWGVLVPVMT
SLHPMPDVSV AKHALTVWKT EVRFMGAGVI GAAAIWTLGK LALPIWSGLM SAFEASKARK
VGGAVIPRTE QDIPIGIVGL VSLLLLAPAG WFLAHFLTGG PITSLATPLV AIGVGYVLIA
GLLAAAVCGY MAGLIGSSNS PVSGIAILTV LGASLMVGVV GRGVIGPDIA KALVAYALYV
TTMVLAVAVV ANDNLQDLKT GQLVDATPWK QQVALIIGVV AGAIVIPFVL ELLNQSNGFS
GAANLSTVAG AKPLDAPQAT LISTLAKGVI GHDLNWSLLG VGALIGLGLV LVDIILRKSS
NGRFSLPPLG VGLAIYLPSA VTAPVVVGAL VGWIYDKMVD KDRMGEAAKR LGVLIASGFI
VGESLFNVAL AGLIVVTQKP GPLEVPFAPS EHVGMIFALI AAAVVVVGLY GWARRSANKI
QA