Gene Mvan_4564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4564 
Symbol 
ID4649028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4903575 
End bp4905206 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content67% 
IMG OID639808034 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_955345 
Protein GI120405516 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.252386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.41587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC TGCTGGCCCA GACCGAGACC ATCGGCAACC CGGTCGCCAA CATCGGCATC 
TTCAGCCTGT TCGTCGTCGT CACGATGATC GTGGTGATCC GGGCGAGCAA GCGCAACGCC
ACCGCCGACG AGTTCTTCAC CGGCGGGCGC GGCTTCTCCG GCCCGCAGAA CGGCATCGCC
ATCGCCGGTG ACTATCTGTC GGCCGCCAGC TTCCTCGGCA TCGCCGGCGC CATCGCGGTC
TACGGCTACG ACGGCTTCCT GTACTCGATC GGCTTCCTGG TGGCGTGGCT GGTGGCTCTG
CTCCTGGTGG CCGAATTGCT ACGTAACACA GGCAGATTCA CGATGGCCGA CGTGCTGAGC
TTCCGCCTCA AGCAGCGGCC GGTGCGGTTG GCCGCGGCCA CCAACACGTT GACGGTGTCG
CTGTTCTACC TGCTGGCCCA GATGGCCGGC GCCGGTGGCC TGGTGGCGCT GCTTCTGGAC
ATCAACAGCC GCACCGGACA GTCCGTCGTG ATCGCCGTGG TCGGCGTGCT GATGATCGTC
TACGTCCTGG TCGGCGGCAT GAAGGGCACC ACCTGGGTGC AGATCATCAA GGCCGTCCTG
CTGATCGCCG GCGCGGGCTT CATGACGGTC ATGGTGCTCG CGAAGTTCGG GATGAACTTC
TCCGAGATCC TCGGCTCGGC GCAATCCGCC ATCAGCGGTT CGACCACCAC CGGCGTCGCC
GGCCGCGACG TGCTGGCCCC CGGTGCGCAG TACGGCGGGT CGCTGACCTC GCAGATCAAC
TTCATCTCGC TGGCGATCGC GCTGGTGCTC GGCACCGCGG GCCTGCCGCA CGTGCTGATG
CGCTTCTATA CGGTGCCGAC CGCCAAGGAG GCACGACGAA GCGTCGTCTG GGCGATCGCG
CTGATCGGCG CGTTCTACCT GTTCACCCTG GTGCTGGGCT ACGGCGCGGC AGCTCTGGTG
GGTCCCGACC GCATCCTGGG CGCGGCGGGC GGGGTGAACT CGGCGGCTCC GCTGCTGGCG
TTCGAGCTCG GTGGGGTGGT GCTGCTTGGG GTCATCTCCG CGGTGGCGTT CGCGACGATC
CTGGCTGTCG TGGCGGGGCT GACGATCACC GCCTCGGCGT CGTTCGCCCA CGACATCTAC
GCCAGCGTGA TGAAGTCGCA CAAGGTGACC GAGGCCGAAC AGGTCCGGGT CTCCCGGATC
ACCGCGGTCG TGCTCGGTGT GCTCGCCATC GGCCTCGGCA TCCTGGCCAA CGGGCAGAAC
ATCGCGTTCC TGGTGGCGCT GGCGTTCGCG GTCGCCGCGG CGGCCAACCT TCCGACGATC
ATCTACTCGC TGTACTGGAG GCGTTTCAAC ACCCGCGGCG CGCTGTGGAG CATGTACGGC
GGGCTGATCT CGACCATCGT GCTGATCGTC TTCTCCCCCG CGGTGTCGGG ATCGAAGACC
GCGATGATCC CGGGTGCGGA CTTCGCCTGG TTCCCGCTGG CCAACCCGGG CATCGTGTCG
ATCCCGCTGG CGTTCATCCT CGGCATCGTC GGCACCCTGA CCTCACCGGA CGACGAGGAT
CCGAAGGTCG CCGCCGAGAT GGAGGTCCGC TCGCTGACCG GCATCGGCGC CGAGAAGGCG
GTCGCCCACT AG
 
Protein sequence
MTTLLAQTET IGNPVANIGI FSLFVVVTMI VVIRASKRNA TADEFFTGGR GFSGPQNGIA 
IAGDYLSAAS FLGIAGAIAV YGYDGFLYSI GFLVAWLVAL LLVAELLRNT GRFTMADVLS
FRLKQRPVRL AAATNTLTVS LFYLLAQMAG AGGLVALLLD INSRTGQSVV IAVVGVLMIV
YVLVGGMKGT TWVQIIKAVL LIAGAGFMTV MVLAKFGMNF SEILGSAQSA ISGSTTTGVA
GRDVLAPGAQ YGGSLTSQIN FISLAIALVL GTAGLPHVLM RFYTVPTAKE ARRSVVWAIA
LIGAFYLFTL VLGYGAAALV GPDRILGAAG GVNSAAPLLA FELGGVVLLG VISAVAFATI
LAVVAGLTIT ASASFAHDIY ASVMKSHKVT EAEQVRVSRI TAVVLGVLAI GLGILANGQN
IAFLVALAFA VAAAANLPTI IYSLYWRRFN TRGALWSMYG GLISTIVLIV FSPAVSGSKT
AMIPGADFAW FPLANPGIVS IPLAFILGIV GTLTSPDDED PKVAAEMEVR SLTGIGAEKA
VAH