Gene Mflv_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0119 
Symbol 
ID4971741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp107183 
End bp108781 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content69% 
IMG OID640454324 
Productextracellular solute-binding protein 
Protein accessionYP_001131402 
Protein GI145220724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.135782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTCAG ATAGACACAT GAACTTCCGC GTCGCGACGG TCCTCGCGTC TGTCCTGGTC 
GCCGTGACGG CGTGCTCGTC CTCCTCCCCG GCCGAGCCCC GCGACCAGAT CGTGCTCGCG
GAAGGATATG AGCTCGGCGG CTTCAACCCG GTCAACGGCT ACGCGGAATC CGGGGTCTCA
CCGATCTACG ACGGTCTCTA CCGGCCGAGC GCAACCACCG ATGCCGTCAT TCCGGAATTG
GTCCCTGCGC TCGCCGCGCA GGAACCGCAG CCGGCCGGGC CGAACCGCTG GCGTGTCCCC
CTGCGCCCCG GCGTCGTGTT CTCCGATGGA ACCACCTTCG ACCCCGTCGA TGTGGTGGCG
ACCTACGACG CGGTCCGGGA TCCCCGGGTG GCGTCGGAGA TCTCGACCTC GGTGGCACCC
ATCGTGTCGA TCGAGGCCGA CGGCCCCGAC GCCGTCGTCG TCGAGCTCGA CACCGCGGCC
GACCCGAAAC CCTATCTGCT GCTGGGCATC CTGCCCTCGG AGAGGGTCGA GGCGACACCG
GCCGCAGATT GGGCCGTCAA CACCGCCCCC GTCGGCACCG GCCCCTACCG ACTCGACAGT
CTGCGTCCCG ATCAGGCCGT CCTGGTCGCG CGCGACGACT ACTGGGGTGA CGCCCCGCAG
GTCCGACGGC TCGTCTACAC CTATGCGCCC GACGACAACG CGCGGGCGCA GAGCATGGTC
TCCGGGGCCG TCGACGGGAC GAATCTTCCA CCGCGGCTCA TCGATTCGGT CAAAGGCTCG
AACGATGACG TGCAGACCGT CGCCGTGCGC TCCGCGGACT GGCGCGGCAT CGCGCTGCCC
GCGGACAATC CGTTCACCGC CGACGTGACC GCCCGGCTGG CGATGAACGT CGGCATCGAC
CGCGACGCGC TGGTCCGCGA TGTCCTCGTC GGTTACGGCA GCCCGGCAGC CACACCGGTG
GCCGACGCGT ACGGCGACGC TTACGACCCC GGGGCGCAGT ACGTGTTCGA CCTCGACCGG
GCCCGGAACC TGCTCGACGA TGCGGGATGG CGCCCCGGCG CCGGACAGAT CCGCGAAAAA
GACGGTGCAC GAGCATCATT CGAGCTGCTG TACAACGCGC AGGACACGCT GCGACGCGAT
CTGGCGGTCG CCTACGCCGC GGCGATGAAG CCGCTCGGCG TCGACGTCCG CCCCCGCGGC
ACCAGCTGGG ACGAGATCGA CACCCGGTTC GCCGATTCCG CGGTGGTGCT CGGCGGCGGC
GCGACGCCCT ACAGCATCGA CTCCCAGGTC TATGACACCC TGCACACGCG GGTGCCCGAC
TCGTCGCCGT ACTCCAACCC GGGTAACTTC ACCGCGCCGG GACTGGACGC GCTGCTCGAA
CAGGCCGCAC AGTCTCCGTC CGGCCCCGCC AAGGACGCGC TGTACCGCGA GATCCAGGCC
AGGTACGCGG CCGCGCCGTC GCATGTGTTC CTGGTGTTCC TGCACCACAC CTACAGCTAC
CGGGATCTCG GCTGGCAGCA GAGCGCGCCG ATCATGGAGC CCCACTCGCA CGGGGTCTCG
TGGGGACCCT GGTGGAATCT CGCGGCGTGG ACACGCTGA
 
Protein sequence
MCSDRHMNFR VATVLASVLV AVTACSSSSP AEPRDQIVLA EGYELGGFNP VNGYAESGVS 
PIYDGLYRPS ATTDAVIPEL VPALAAQEPQ PAGPNRWRVP LRPGVVFSDG TTFDPVDVVA
TYDAVRDPRV ASEISTSVAP IVSIEADGPD AVVVELDTAA DPKPYLLLGI LPSERVEATP
AADWAVNTAP VGTGPYRLDS LRPDQAVLVA RDDYWGDAPQ VRRLVYTYAP DDNARAQSMV
SGAVDGTNLP PRLIDSVKGS NDDVQTVAVR SADWRGIALP ADNPFTADVT ARLAMNVGID
RDALVRDVLV GYGSPAATPV ADAYGDAYDP GAQYVFDLDR ARNLLDDAGW RPGAGQIREK
DGARASFELL YNAQDTLRRD LAVAYAAAMK PLGVDVRPRG TSWDEIDTRF ADSAVVLGGG
ATPYSIDSQV YDTLHTRVPD SSPYSNPGNF TAPGLDALLE QAAQSPSGPA KDALYREIQA
RYAAAPSHVF LVFLHHTYSY RDLGWQQSAP IMEPHSHGVS WGPWWNLAAW TR