Gene Mvan_5688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5688 
Symbol 
ID4646209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6077054 
End bp6078244 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID639809164 
Producthypothetical protein 
Protein accessionYP_956459 
Protein GI120406630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0203181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTGT TTGATGACGT CTCGGGTGTC GTCGGCAATG TCGTGGACGT TGTCGAGGGC 
AGCGTGGGCA CTGTGGAGAG CGGTGTCGAC TTCGTCGGCA GCGTCCTCGA CGGTGATGTC
ACCGGGGCTA CCCGGGACGC CTACGAGTTT GTCGACAACG CCAGGGACGT TCTCGGTGGC
GTCCGCGACC TCGGTGTGAG CATCGGCCGG GTGCCGTCAC GCTTCGTCGA CAATCCGATT
CTGGAGCTGG CGAATTCGCC GCCGATCGAA CTTGCGCAGC GTGTCATCGC CGGCATGCGA
TTGACCACCG GATCCGGTGA CCCGGTCGAC GGAGAAGAGT TCAAGAACGC TGCCAAGCTG
ATGCAGGAAG CTTTGGAGCT TCTCATCGAC GCGGCTCCGC ATGGCGACCA TTGGAGCGGG
GCTGCCTCGG AGCAGTACGC GGGAGCGAAC ACGCAGAACA GGCGTGCAAC TTCGGGTGTC
CAAGTTGCCG ACTGGAATAT TGCGGACTTC CTCGAAGCCG AGGCCGAACA GGTCATGCGC
ACCCGCAAGA CTCTCGACGA CATCAATGAG TACCTTTTTC AGTTCGCGCT CTCGACGGCG
TGGATGAACG GTATTCCAGG CGGACGCGCG GCCAAGCTTG TCGTCGATGC GACCGCTGCC
GCCGGCGGTG CCCTCAGCGC AGAGCGCGCA TTACTCACAC TCGTCGCGGA CTCCGTTGAG
CGGGGCGGAA AGATCCGCGA TCAGTTCAAT CTCTATGAGG GCGCAGCCAA CCAGGAGCGA
AAACTCGATG ACCCAACGTG CGACGGGCCG TTCGTGCCGG CAAAGGTTGA TCGGAGCGAC
GCGGGGCGGC CGGGCCGCTT CGACAACCCG CACTACACCG TGCCTACACC CGTTGAACCT
CCGCAACACG GACCTGCGGC AACTCCTTAT ATCGGCGGGG GTTCATCGGT ACCAATGCCT
GCGCCTCAGA GCACTTCTCG AACTGCGCAT CCACAGGGGT CGGCACCACG ACCCTGGGCG
GGGACCAACC CGACCGCAGC CCCCGCCTCG CCCACACCGC AGCCGACCGG CGCACCCGGC
ACGCGCACTG GGTCGCTACC AGGACAAGGC GCCGGCAATC GTGGCGCCGC ACCTCTGAGA
GTTACAGCGC GGCCGACATC CGATGACAAG AGTCAGGAGA TTCCGACGTG A
 
Protein sequence
MSLFDDVSGV VGNVVDVVEG SVGTVESGVD FVGSVLDGDV TGATRDAYEF VDNARDVLGG 
VRDLGVSIGR VPSRFVDNPI LELANSPPIE LAQRVIAGMR LTTGSGDPVD GEEFKNAAKL
MQEALELLID AAPHGDHWSG AASEQYAGAN TQNRRATSGV QVADWNIADF LEAEAEQVMR
TRKTLDDINE YLFQFALSTA WMNGIPGGRA AKLVVDATAA AGGALSAERA LLTLVADSVE
RGGKIRDQFN LYEGAANQER KLDDPTCDGP FVPAKVDRSD AGRPGRFDNP HYTVPTPVEP
PQHGPAATPY IGGGSSVPMP APQSTSRTAH PQGSAPRPWA GTNPTAAPAS PTPQPTGAPG
TRTGSLPGQG AGNRGAAPLR VTARPTSDDK SQEIPT