Gene Mflv_5163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5163 
Symbol 
ID4976474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5503250 
End bp5504842 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content71% 
IMG OID640459393 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_001136417 
Protein GI145225739 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR01195] sodium pump decarboxylases, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0799644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGC CGGTATCGAC CTCCACCTGG GCGCCGCTGC AGTCGCCGGT GTTCCGCGCG 
CTGTGGATCG CGCAGTTCGT CTCGAATCTC GGCACCTGGA TGCAGACCGT CGGCGCCCAG
TGGATGCTGG TCGACGACCC GGCCGCCGCG GTTCTGGTGC CGTTGGTGCA GACGGCCACC
ACGTTGCCGG TGATGCTGCT GGCGTTGCCG TCGGGTGTGC TGGCCGATCT CGTCGACCGC
CGCCGGTTGC TGATCGCGAC GCAGGGCGCG ATGGCCGCCG GCGTGGGTCT TCTCGCCACG
CTGACCGGTG CCGGGCTGAC CACCCCGACG GTGCTGCTGA CGCTGTTGTT CGTGATCGGC
TGCGGGCAGG CGCTGACCGC GCCGGCCTGG CAGGCGATCC AGCCGGATCT CGTGCCGTCG
GATCAGATTC CCGCCGCTGC GGCGCTCGGC AGCATGAGCA TGAACGGGGC CAGGGCGATC
GGACCCGCGA TCGCCGGTGC TCTGGTGTCG CTGACCGGCC CGACGATCGT GTTCGCACTG
AACGCGGTGT CGTTCGTGGG CATCGTGCTG GTGCTGATGT GGTGGCGGCG TCCGCCCGTG
CGCAGCGACT ACCCGCCCGA GCGGGCGCTG GCCGCGCTCA GTGCCGGCGG CCGCTACATC
CGCAGCTCGC CCATCGTGCG GCGCATCCTG CTGCGCACGG TGTTGTTCAT CGCGCCCGGC
AGCGCGGTGT GGGGGTTGCT GCCGGTCATC GCGAGGGACC AGCTGGGCCT GGGCTCGGCC
GGCTACGGCG CGCTGCTCGG CGCACTGGGT GTCGGGGCGG TGCTCGGCGC GTTCGCGTTG
TCGCGGCTGC GGGCCCGGTT CGGTCAGAAC AAGCTGCTGA TCGCCGGGGC GGCCGGCTTC
GGGATGGCCA CGGCCGTGCT CGCGCTCGTC CACAACGTCG CGCTCGTCGC GGCGGCATTG
GTGGTCGGGG GCGCCGCGTG GCTGCTGACG CTGTCGACGC TGAACGCGTC GATGCAGCTG
AGCCTGCCGA GTTGGGTGCG GGCCCGCGGC CTGTCGGTGT ACCAGCTGAT CTTCATGGGC
GGTCAGGCCG TCGGTTCGCT GCTGTGGGGG CTGCTCGCGG GCGGCACCAG CAGTGTCATC
AGCCTGCTGG TCAGCGCCGG GTTGCTGATC TTCTGTGGGC TGTCGCTGTG GTGGTGGCCG
CTGCACGCCG GCACCGGAAA CCTCGACCTC ACCCCGTCCT CACATTGGCC GGAGCCGACG
TTGCTGTTCG AGCCCGAGCC GCTCGACGGG CCCGTCCTGG TGATCACCGC CTACCGGGTG
CTGCCGGAGA ACGAGGAGCC GTTCATGTCC GCGATGGCCC GCCTGGGACG GTCGCGGCAG
CGCACCGGGG CCTCGATGTG GCAGCTGTTC CGCAGCATCG AGCAGGAGTC GACGTTCGTG
GAGACGTTCA TCGTGCGTTC CTGGGGTGAG CACATGCACC AGCACTACAC GCGGCTCACG
GGGCAGGATC AGCTGATCGA GCAGGACGTC GAGCGGTACA CCGAAGGCGA GGCCGTCGCC
GAGCACTATC TCGCGGTGCG CGACGCGCGC TGA
 
Protein sequence
MTSPVSTSTW APLQSPVFRA LWIAQFVSNL GTWMQTVGAQ WMLVDDPAAA VLVPLVQTAT 
TLPVMLLALP SGVLADLVDR RRLLIATQGA MAAGVGLLAT LTGAGLTTPT VLLTLLFVIG
CGQALTAPAW QAIQPDLVPS DQIPAAAALG SMSMNGARAI GPAIAGALVS LTGPTIVFAL
NAVSFVGIVL VLMWWRRPPV RSDYPPERAL AALSAGGRYI RSSPIVRRIL LRTVLFIAPG
SAVWGLLPVI ARDQLGLGSA GYGALLGALG VGAVLGAFAL SRLRARFGQN KLLIAGAAGF
GMATAVLALV HNVALVAAAL VVGGAAWLLT LSTLNASMQL SLPSWVRARG LSVYQLIFMG
GQAVGSLLWG LLAGGTSSVI SLLVSAGLLI FCGLSLWWWP LHAGTGNLDL TPSSHWPEPT
LLFEPEPLDG PVLVITAYRV LPENEEPFMS AMARLGRSRQ RTGASMWQLF RSIEQESTFV
ETFIVRSWGE HMHQHYTRLT GQDQLIEQDV ERYTEGEAVA EHYLAVRDAR