Gene Mflv_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3775 
Symbol 
ID4975091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4030044 
End bp4031369 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID640457999 
Productextracellular solute-binding protein 
Protein accessionYP_001135035 
Protein GI145224357 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.577316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.394592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCG CCCAAATTCC CACTGCCCGG CGGGGCGGGA AGACTGCCCG CTCCGCCGTG 
CTCGCTCTGT TGGCCGTCCT TGCGCTGGTG CTCTCCGCGT GCGCCGGCAG CGGCGGACCC
GAGCAGGCCG AGAGCACCGG AAGCGGCGAG GTCTCCGCCG ACACCTCGGG CACCGTGCGG
ATCCTGATGG AGAACGTGCC CGACACCGAC ATCGTCAAGT CGATGGTGGC CGACTTCAAC
GCCGAATACC CGGGCGTCGA GATCAACATC GAGTCGCTCA CGTTCGACCA GATGCGCGAC
AAACTGGTGT CCTCGTTCCA GTCCTCGTCG CCGGCCTACG ACCTGATCGT CGTCGACAAC
CCCTGGATGG TCGACTTCGC CAACGCGAAG TTCCTGCAGC CACTCGATGC CCGCATCGAC
AGCACCCCCG ACTACGACGC CGGCGACTTC TTCACGCCGC TCACCGACAT CACCACCGTC
GACGGCACCC GCTACGGCGT TCCGTTCTAC AACTACGCGC TCGGATACCT GTACAACGCC
GACGATCTCG CCGCCGCCAA CCAGCAGGTG CCGACCACGC TCGACGAACT CGTCAGCACC
AGCAAGGCGC TCAAGAGCGG TGACCGCGCC GGCATCGCGA TGCAGCCGCA GCGCGGCTAC
AAGATCTTCG AGGAGTGGGG CAACTGGCTC TTCGCCGCGG GCGGGTCGAT CTACGACGCC
GACGGCAAGA TCACGCTGAA CACCCCTGAG GCCAAGCGGG CGCTCGAGGC CTACATCGAC
ACCTACAACA CCGCCGCACC CGCCAACAGC CTCAGCTGGG GCATGGACGA GGCGCAGCGC
TCGGTGTCGG CGAACCAGTC CGCGTCGATG ATCAACTACA ACTGGCAGCT GCCCGCCCTC
AACGAGCCGG GCTCCGGACC TGCCGCCGGC AAGATCAAGC TCGCCACCAT CCCCGGCGGC
AAGCAGGTGC TGGGCTCCTG GAGCTGGGCG ATCCCGGCCA ACTCCGCGAC TCCCGACGCG
GCATGGGCGT TCGTCTCGTG GATCACCGCC AAGCCCAACG ATGTCGTGCG CACCGAGAAG
GGCGGCGCGG CGATCCGCAA GAGCACGCTG CAGAATCCCG CTGTGCTGCA GGGCCAGTTC
GGCGAGGAGT ACTACCGGAC CGTCGAGCAG CTGCTCGCCG ATGCGGCGCC CCTGACCCAG
GGCCCCAGCG GCGAGGAGAT GATCCAGGCC GTCGGCACCG CGCTCAACGA AGCGGTCGCC
GGTGAGAAGA GCGTCGACGA CGCCCTTGCC ACCGCACAGG CCGAAGCCGA GAAGATCCAG
GGCTAG
 
Protein sequence
MRFAQIPTAR RGGKTARSAV LALLAVLALV LSACAGSGGP EQAESTGSGE VSADTSGTVR 
ILMENVPDTD IVKSMVADFN AEYPGVEINI ESLTFDQMRD KLVSSFQSSS PAYDLIVVDN
PWMVDFANAK FLQPLDARID STPDYDAGDF FTPLTDITTV DGTRYGVPFY NYALGYLYNA
DDLAAANQQV PTTLDELVST SKALKSGDRA GIAMQPQRGY KIFEEWGNWL FAAGGSIYDA
DGKITLNTPE AKRALEAYID TYNTAAPANS LSWGMDEAQR SVSANQSASM INYNWQLPAL
NEPGSGPAAG KIKLATIPGG KQVLGSWSWA IPANSATPDA AWAFVSWITA KPNDVVRTEK
GGAAIRKSTL QNPAVLQGQF GEEYYRTVEQ LLADAAPLTQ GPSGEEMIQA VGTALNEAVA
GEKSVDDALA TAQAEAEKIQ G