Gene Mflv_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3814 
Symbol 
ID4975130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4074445 
End bp4076091 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content70% 
IMG OID640458038 
Productextracellular solute-binding protein 
Protein accessionYP_001135074 
Protein GI145224396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.028242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.618812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCC GTTGGCGTGC GCTGACCGCG GTGCTCGCGG TGGTGGGCGG ACTCGGGCTC 
ACCTCCTGCG GCGAGGCCAC CGCCGACTCC GTCGACTACG CCGTGGACGG AGTGCTGACC
AGCTACAACA CCAACACGGT CGTGGGCGCC GCCTCGGCAG GGCCGCAGGC GTTCGCGCGG
GTGCTCACCG GCTTCAGCTA CCACGGGCCC GAGGGCCAGA TCGTCGGTGA CCACGACTTC
GGCACGATCT CGGTGGTCGG GCGGACCCCG CTGATCCTCG ACTATGAGAT CAAGCCGGAA
GCGGTCTACT CCGACGGCAA ACCCATCACC TGCGATGACA TGGTGCTCGC GTGGGCGTCG
CAGTCCGGCC GGTTCCCGGC GTTCGACGCG GCCAGCCGTT CGGGCTACGC CGACATCGCC
GCGATCGAAT GCGCGCCGGG ACAGAAGAAG GCCCGGGTGT CGTTCGCGCC GGAGCGGGCG
TTCACCGACT ACGGGCAGTT GTTCACGGCG ACGTCGATGA TGCCGTCCCA TGTCGTCGGC
GACGTGCTCG GGCTCGGAGA CGGTGCCGTC ACGACCGCGC TGCTGAACAA CGACATCCCG
GCCGCCGAGC GCATCGCGCA GGTGTGGAAC ACGACCTGGA ACCTCGGTCC GGATCTGGAC
CTCAAGAAGT TCCCGTCGTC GGGCCCCTAC AAGCTCGACT CCGTCACCGC CGACGGCGCC
GTGGTGCTGG TCGCCAACGA CAAGTGGTGG GGCGCCAAGC CGGTCACCGA CCGTGTCACC
GTCTGGCCCC GCAGCCCCGA CATCCAGGAC CGCGTCAACG AGGGCGCCTA CGACGTCGTC
GACATCGCCG CCGGCTCCTC GGGCACCCTG AACGTGCCCG ACGACTACGT CCGCACGGAC
GCCCCGTCGG CGGGCATCGA GCAGCTGATC TTCGCGCCCG AGGGAGCGTT GGCGGCCGTG
CCCGCCCGGC GCGCGCTGGC GTTCTGCACC CCGCGCGACG TGATCGCCCG CAACGCCGAG
GTTCCCGTGG TCAACGCCCG CCTCACCACC GCCACCGAGG ACGCCATCGG GTCGGCCGAG
CTGACGCCGC AGATCAACGA GTTCGCGGTG GCCAATCCCG ACGCTGCGCG CCAGGCGCTC
GGCAACACAC CGCTGACCGT GCGCATCGGC TATCAGACGC CGAACGCCCG GCTGGCGGCG
ACGGTCGGCA CCATCGCGAA GGCCTGCGCC CCGGCCGGTA TCACCGTGCA GGACGCGGCG
AATGCCGACA CCGGGCCGAC GGCGTTGCGG GACAATCAGA TCGACGTGTT GATCGCGAGC
ACCGGGGGAG CGGCGGGCAG CGGGTCGTCG GGCTCGTCGG CGGTGGACGC CTACACGCTG
CACAGCGGCA ACGGCAACAA CCTGCCCCGC TACGCCAACG GGCGCATCGA CGCGATCATC
TCCACGCTGG CGGTGACCAC CGACCCCAAG GAGTTCGCCC GGCTCCTCGG CGAGGCCGGC
CCGATCCTCT GGGCGGATAT GCCGACGCTG CCGCTGTACC GTCAGCAGCG GACGCTGCTC
ACCTCGACCA AGATGTCCGC GGTGATCGGC AACCCGACAC GATGGGGAGC GGGCTGGAAC
ATGGACCGTT GGAAGCTCAG CCGGTGA
 
Protein sequence
MAGRWRALTA VLAVVGGLGL TSCGEATADS VDYAVDGVLT SYNTNTVVGA ASAGPQAFAR 
VLTGFSYHGP EGQIVGDHDF GTISVVGRTP LILDYEIKPE AVYSDGKPIT CDDMVLAWAS
QSGRFPAFDA ASRSGYADIA AIECAPGQKK ARVSFAPERA FTDYGQLFTA TSMMPSHVVG
DVLGLGDGAV TTALLNNDIP AAERIAQVWN TTWNLGPDLD LKKFPSSGPY KLDSVTADGA
VVLVANDKWW GAKPVTDRVT VWPRSPDIQD RVNEGAYDVV DIAAGSSGTL NVPDDYVRTD
APSAGIEQLI FAPEGALAAV PARRALAFCT PRDVIARNAE VPVVNARLTT ATEDAIGSAE
LTPQINEFAV ANPDAARQAL GNTPLTVRIG YQTPNARLAA TVGTIAKACA PAGITVQDAA
NADTGPTALR DNQIDVLIAS TGGAAGSGSS GSSAVDAYTL HSGNGNNLPR YANGRIDAII
STLAVTTDPK EFARLLGEAG PILWADMPTL PLYRQQRTLL TSTKMSAVIG NPTRWGAGWN
MDRWKLSR