Gene Meso_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_4084 
Symbol 
ID4182784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp4394796 
End bp4396664 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content59% 
IMG OID638069980 
Productextracellular solute-binding protein 
Protein accessionYP_676616 
Protein GI110636408 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCTA AGGTCATCGA ACAGGGGAAA AAGCTAACCC TTGTCGGTCT TGGGATGTTC 
TTACTGGTCG CGAACGCAAG CGCGCAGGAG TGGCGCACCA CAAGTTCGCT GGTCAATCCT
GAGGCGGAGA CCAAGCCCTT TGAACGCTAC AGCTATGTAA ATCCAGAAGC ACCGAAGGGC
GGCACTCTGA ACTCCGCCGT CTTCGGCACA TTCGACAGCT TCAATCCCTT CATTGTGCGC
GGCACGCCGG CGGCGGGCCT CACCTATTTC GGCGGCATGT TGTGGGAAAC GCTCATGCAA
CAGTCGCCCG AAGATCCGGG CACCAGCCAT CCGTTGATCG CCGAGGCGTT CAAATACCCG
GAGGATTATT CCTCGGCCAC CTACCGTCTC AATCCGAACG CCCGCTGGCA TGACGGCAAA
CCTGTCACCG TGGAGGACGT GATCTGGTCG TTCAACATGC TCAAGGAGAT CAGTCCGCAG
CACAACCGCT ATTTCGCGAA TGTCGAGGAG GCCGTCGCGC TCTCCGACAC AGAGGTCGAA
TTCCGCTTCG ACCAGGGCGG AAACCGCGAG CTGCCGCATA TCATGGGCGA CCTTCCGGTC
CTTCCCAAGC ACTGGTGGGA GGGGACGGAT TCCCAAGGCA GGCAGCGTAA TCTCAGAAAT
CCCACCCTGG AGCCGCCGCT CGGCAGCGGA CCCTACAAAA TCGCGAGCTT CCGCCCCGGC
TCGGAAATCA TATGGGAGCG CGCCGAAGAT TATTGGGCTG CGAATCTGCC GGTCAATATC
GGCCGGTACA ATTTCGATCG CATCCGATAC ACGTATTTCC AGGACGACAA TGCCGAGTTC
CTGGCGTTCC AGAAAGGCGG CATCGAGGAT GTACGGCGCG AGCTGAGCAC GCGGCGCTGG
TCACAGGAAT ATGACTTTCC CGCGGTGCAG GATGGCGACG TCATCAAGCG TGAATTTACC
AGCACGGCCA TCGAGGGGAT GCAGGCCTTC GTCTTCAATA TGCGGAGACC TCGGTTCCAG
GACAGCCGCG TGCGCGAGGC TCTTACGCTG GCATACAATT TCGAGGAACA GAACAGAACG
CAGTTCTTCG GGCTCAACAA GCGCTTCAGC AGCTATTTCG AGCGCTCGGA GCTGGCATCG
AGCGGTCTGC CCCAAGGGCA GGAGCTGGAA ATCCTGGAGG AATTCCGCGA TCAACTTCCG
CCGGAAGTTT TTACCGAAGA GTTCAAGCTG CCCGTCTATG ATTCGCCGCA GTCCGAACGG
CAATATCTGC GCGAGGCGGT TCGCCTCTTC AACGAGGCCG GGTGGGAAAT CCAAAGCGGC
CGGATGATCA GCAAAGAGAC AGGCGAGCAA TTCCGCATCG AGTTTCTTGG AGCATCGCCG
ACCGCCGAGG TCATCACCGG CGGCTTCATG GCCAATCTAC GGAAGATCGG AATCAATGCG
ACGCTGCGCA TCGTCGACAC GTCGCAATAT ATACAGCGTG TTCAGAACTT CGAATTCGAT
GCCATCACAG CCCGCTTCCC CCAGTCCAAC TCTCCGGGCA ACGAGCAGCG GGATTACTGG
AGTTCGGAGG CCGCCGACAT CCCGGGTTCG CAAAACGTGA TCGGCATCAA GGATCCGGTG
GTGGACGCCT TGGTGAACAA GATCATCTAC GCCAAGAACC GCGAGGAACT CGTCGCGACG
GTTAGGGCGC TCGATCGCGT GCTCCTCTGG AAGTACTACG CGATCCCGCA ATACTACCAG
CCCACCCTTC GCTATGCCTA CTGGAACAAA TTCGGCATAC CGGAAAAGCA GCCGGGCTAT
GCGGGCGTGG ATGTCGATTC CTGGTGGGTC GATCCCGAGC TCGAGGCGGC GCTCGAGGCG
AAGTACTAG
 
Protein sequence
MLAKVIEQGK KLTLVGLGMF LLVANASAQE WRTTSSLVNP EAETKPFERY SYVNPEAPKG 
GTLNSAVFGT FDSFNPFIVR GTPAAGLTYF GGMLWETLMQ QSPEDPGTSH PLIAEAFKYP
EDYSSATYRL NPNARWHDGK PVTVEDVIWS FNMLKEISPQ HNRYFANVEE AVALSDTEVE
FRFDQGGNRE LPHIMGDLPV LPKHWWEGTD SQGRQRNLRN PTLEPPLGSG PYKIASFRPG
SEIIWERAED YWAANLPVNI GRYNFDRIRY TYFQDDNAEF LAFQKGGIED VRRELSTRRW
SQEYDFPAVQ DGDVIKREFT STAIEGMQAF VFNMRRPRFQ DSRVREALTL AYNFEEQNRT
QFFGLNKRFS SYFERSELAS SGLPQGQELE ILEEFRDQLP PEVFTEEFKL PVYDSPQSER
QYLREAVRLF NEAGWEIQSG RMISKETGEQ FRIEFLGASP TAEVITGGFM ANLRKIGINA
TLRIVDTSQY IQRVQNFEFD AITARFPQSN SPGNEQRDYW SSEAADIPGS QNVIGIKDPV
VDALVNKIIY AKNREELVAT VRALDRVLLW KYYAIPQYYQ PTLRYAYWNK FGIPEKQPGY
AGVDVDSWWV DPELEAALEA KY