Gene Meso_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_4520 
Symbol 
ID4178444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008243 
Strand
Start bp83649 
End bp84896 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID638059407 
Productextracellular solute-binding protein 
Protein accessionYP_666129 
Protein GI110347312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAC TATCGAAATT CTTAGGGCTC AGCATGCTGA GCATTGCCAT GACGCTGCCG 
GCCGTTGCCG CCTCCGCCGA AGAGATCACC TGGTGGGCCC CGAACTGGGG TGAGGCGCGC
GCCCGGAAAC TGGTCGAGGA CTTCCAGGCC GCCAACCCGG ATGTCACGGT TAACCTGGAG
ATCACCGTTT CGAACGGCCT CCAGAGCCGC ATCGAGGTCG CCCTGCGCTC GGGAAACCCG
CCGGACCTGA TCGACACCAG CATGCAGTGG GTCATCCCGT TCGCCAGCAG CGGCAAGCTG
CTCGATCTCG ACCAGTTCGC CAAGGAACAG GTTAACCTCG ACGACCTTCT GCCGGCCACA
CTGGATTCCA CCCGCTACAA CGGCCACATC TACGGCCTGC CTTATCGGGC CCAGACCCTG
GCCTTGATCT ACAACAAGGC GCTTTATCGC GACGCCGGTC TCGACCCGGA CAATCCGCCG
AAGACCTGGG ACGAATTCAT CAAGGCCTCG CAGGCGCTGA CGAAGACCAA TACGGCCGGC
AAGCAGCAGT ATGGCATCGG CGTTGCCGGC GGCGGCGAAT TGGGCAATCT GATCACCCGC
ATGGTTCCGT TCATTTGGAT GAACGGCGGC GATGTTCTCA ATGCCGATTT CACCGAGGCG
ATCGTCAACG AGAAGCCGGC GGTCGAAGCC GTCGAGTTCT ACACGGCACC GCTGACCAAG
TACAACATTG CACCGCCTTC GACGCTGCAG AACGACGGCC TTGCACTGCG CAGGCTTTTC
GGCGCCGGAA CCGTCGCGCA ATATTTCTCC GGACAGTTCG ACCTTCCCGC CATCAAGCAG
GAAGCGCCTG ACCTGGAGAT CGGCATCGCT CCGTTCCCGC ATCCGGAAGG CAAGCAGACT
GCGGGTATCC TGAGCGGCTG GGCTTTCGTA GTGCCGGCCG ATTCGCAACA TAAGGACGCC
GCACTCCGTT TGGCGAAATT CCTCATGCTG CCGGAAAACC AGGGCTATTA CACCGATACC
TTCCCGGCCA GTATGAGTGC CATGGACCTG CCAAGGTTTA AGGACCCGCT TCTGCAGCCC
TTCAAGGAGA TGCTGAAGTT CACCAAGCCC GCGCCTTCGA CACCCGTCTG GATCAAGGCT
CAGCAGATCC TCTTTGCCCA CACCCAGGAA GTCCTGCTCA ACTCCGCAAC GGCGCAGGAA
GCCATGGATG CTGCGGCCGA AGAGATAAAC GACGCGCTCG CCCGCTGA
 
Protein sequence
MKRLSKFLGL SMLSIAMTLP AVAASAEEIT WWAPNWGEAR ARKLVEDFQA ANPDVTVNLE 
ITVSNGLQSR IEVALRSGNP PDLIDTSMQW VIPFASSGKL LDLDQFAKEQ VNLDDLLPAT
LDSTRYNGHI YGLPYRAQTL ALIYNKALYR DAGLDPDNPP KTWDEFIKAS QALTKTNTAG
KQQYGIGVAG GGELGNLITR MVPFIWMNGG DVLNADFTEA IVNEKPAVEA VEFYTAPLTK
YNIAPPSTLQ NDGLALRRLF GAGTVAQYFS GQFDLPAIKQ EAPDLEIGIA PFPHPEGKQT
AGILSGWAFV VPADSQHKDA ALRLAKFLML PENQGYYTDT FPASMSAMDL PRFKDPLLQP
FKEMLKFTKP APSTPVWIKA QQILFAHTQE VLLNSATAQE AMDAAAEEIN DALAR