Gene Meso_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_1950 
Symbol 
ID4182517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp2090851 
End bp2092371 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID638067846 
Productextracellular solute-binding protein 
Protein accessionYP_674508 
Protein GI110634300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.412794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAACC TGTCTAAATC CGGCCTGGCC TGGACCTTGC TGGCTGCGCA GGCCGGACTT 
GCCTTCGCGG CAGCACCGGC CCTCACCGCC GAAGGGTCCA TCGTCCTGGC GATGCCGGCC
GAGCCGACCT CGGTCGATGC CTGCGATGAC AGCACTAGGG CCAATGCCCG GGTGCTGCGC
GGCAATGTCG TCGAGGCGCT GACGCGGCTC GACCCACAGT CGGGCGCGGT CGGCCCGCTG
CTCGCCACCG AATGGTCTAG TCCGGATAAC AAGAGCTGGC TCTTCACCAT CCGGCCGGGC
GTCACCTTCC ACGACGGCAC GCCGCTGGAT GCGGCCGCGG TGGCGTTCGG CATCAACCGC
TCGATGAACC CGGACCTGAC GTGCCAGACC CTGTCGCTGT TCCCCACCAA GACCACCGCG
ACCGTCGAAA GCGACATGGT GGTACGGATC ACCACCGAAG AGCCCGACCC GATCCTGCCG
GCACGCATCG CCTATATCGA TCTGCCCTCC CCCAAGACTC CGGAAGCCGC CAAAAGTGAC
ACCCCGATCG GCACCGGTCC CTACCGGTTT GCCGGCCGCG AGATCGGCCA GTCGATAACG
CTATCGGCGT TTGACGGCTA TTGGGGCGAA GCCCCCGAGA TCGCCGAAGC CAACTATGTC
TGGCGGGCCG AGGCCACCAT CCGCGCCAGC ATGATCAAGA CCGGCGAAGC CGATATCGCC
TATGATATCC CCACCCATGA GGCCGAAGGC CAGGCCAATG CCCAGCAATA TTTGACCAAC
GGCGTGTTCT ACCTGCGCCC GATGCTGCAG AAGCCGCCGC TGGACGATCT GCGCGTGCGC
CAGGCCATCG CCTCCTCGAT AGACAAGGCC ACGCTGGCCG AAGTGCTGAT GGACAATTCG
GGCACGCCGA CCGGGCAATT GGTCACACAG CTGATCAATG GCTACGTGCC CGATTATACC
GGCATGCCCT ATGATCTCGA AAAGGCCAAG GCACTGTTCG CGGAAGCCAA GGCGGCCGGC
GTCGCCGTCG ATACGCCGAT CACGCTGGTG GCCCGCACCG ACCTGTTCAG CGGCGCCGAG
GAAGTCTCGC AGGCGATCCA GCAGATGATC CAGCAGGCCG GCTTCACCGT CACGCTGAAA
TCGGTGGATA CCGTCGGCTG GAGCCCCTGG GCTCGCAAGC CGGACTCGCT TACCCAGCCG
GTGAACCTGC TCACCTCGAG CCACAACAAC ATTTCGGGCG ACGGTTCGCT GACCTTCCCG
AACTTCCTGG GCAGCGGCGG CCGGCTGAGC GTGGTCGACA ATGCCGAACT CGATGCCAAG
CTGGCGGCCG CGGCCAAGGC CAGCGGCGAA GACCGGGCCG CGGCATATCG CGAGATCGCG
CAATATGCCT ATGATCAGGA ACTGGTCATT CCGGTTGCGG CCCTGCAGGG GCTGCTGCTG
ACCTCGGATC GCATTGCCTA CGAGGCGGAT GGCTTTACCG ACATCGAACT GCATCTGTCC
GACGTCAAGC ACAAGCAGTA G
 
Protein sequence
MINLSKSGLA WTLLAAQAGL AFAAAPALTA EGSIVLAMPA EPTSVDACDD STRANARVLR 
GNVVEALTRL DPQSGAVGPL LATEWSSPDN KSWLFTIRPG VTFHDGTPLD AAAVAFGINR
SMNPDLTCQT LSLFPTKTTA TVESDMVVRI TTEEPDPILP ARIAYIDLPS PKTPEAAKSD
TPIGTGPYRF AGREIGQSIT LSAFDGYWGE APEIAEANYV WRAEATIRAS MIKTGEADIA
YDIPTHEAEG QANAQQYLTN GVFYLRPMLQ KPPLDDLRVR QAIASSIDKA TLAEVLMDNS
GTPTGQLVTQ LINGYVPDYT GMPYDLEKAK ALFAEAKAAG VAVDTPITLV ARTDLFSGAE
EVSQAIQQMI QQAGFTVTLK SVDTVGWSPW ARKPDSLTQP VNLLTSSHNN ISGDGSLTFP
NFLGSGGRLS VVDNAELDAK LAAAAKASGE DRAAAYREIA QYAYDQELVI PVAALQGLLL
TSDRIAYEAD GFTDIELHLS DVKHKQ