Gene Mvan_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4060 
Symbol 
ID4643483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4342554 
End bp4343684 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID639807524 
Producthypothetical protein 
Protein accessionYP_954843 
Protein GI120405014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.271595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTGC TCGATGCCTT CATGGCGATC TGGTCCAAAG CGCGCGGGAC TTTCGGTGAC 
GGCGTGCCGC AGGACGGATC GGTCTATGAC GCCAGTCCGG CGCTGACACG CCTGCAGGAC
GAGGTTCGCG GTGCCGCGCC GCGCGAGACA TGGACCGGGT CGGCTTCGGA CGGGTACGCA
TCGGCCAATG AACGGCACGC CTCGGTGCTG GGTGAGACCG CCCTTCTGGA CCGTCGGCTG
CGCGCCGAAA TCGACCGGTC GGCGGAGGTC GTCCGGGCCG GCAGGCGAGA ATTGGATGCG
GTGCGTCGGC GCGTGAGCGA GGCCGCCGCC GGTGTGCCGC CGACGCCGGA GGGTGAGAGC
ATGCTGTACC CGGCCATCAG CCGGGGCAGT GGCGAGATCG TCGACATCGT GCAACGCTCC
CACACCGACC TGAACATGAT CGCCGGTCGC ATCGAGGCGA TCGCGTCGGA GTACCAGGCG
CTCGGCGAGG GCACAGAGCT CGGGGTGGGC GACGGCGCCG AGCCGGCGGG AGGCGCTGAG
CCCGGCGATG AACCCCGACT TGCCGTGTCC AACGAGAACG AGCCATGGAC CTATCCGTTC
GATCCGCCCG CACCCCCGGA CTCCGCGCCC GGGGGCGGCA GGTGGGAACT CGGCCAGGCT
TACCCGCCGG GCCCGGGTGG CGGTCCTCCG ATGGGGCCTA TCCCGGCGCC GAAACCTTGG
CATCGCAGTA TCGATCCGCC GGTGAAAGGT GGTGGTTCGG GCTTGGAGGA TGTTGTCACG
CCCCCACCCA ACGGTGTGGG TGTGAGGCCG CCGCTCGTCT TGCAGGAGTC TTATGAGTTC
AGAGTGACTG GCGAGGGCTT TAGAGATGGT GACGGTCATC TACGGTGGAT ACAGCGCGAC
GGCAGCTGGT ATCAGGCCCA GTGGATCGAC TATGAGCTTG AGGCAAACCA TCTACAGCAA
CTCACCGGCA ACGTCTCTGT GCCGTTGGGA GACCATACCT GGGAGCCCAT CGACATCAAG
GACATTTACC AACTGCAGGT GGACAACCCA CGCCTAACCC TCTACATCCC GGATCCGTCT
GGCAGCGTTC TCGAACTCGA CCCTGATCGA CCTGCGGCAT CCGGACCGTG A
 
Protein sequence
MAVLDAFMAI WSKARGTFGD GVPQDGSVYD ASPALTRLQD EVRGAAPRET WTGSASDGYA 
SANERHASVL GETALLDRRL RAEIDRSAEV VRAGRRELDA VRRRVSEAAA GVPPTPEGES
MLYPAISRGS GEIVDIVQRS HTDLNMIAGR IEAIASEYQA LGEGTELGVG DGAEPAGGAE
PGDEPRLAVS NENEPWTYPF DPPAPPDSAP GGGRWELGQA YPPGPGGGPP MGPIPAPKPW
HRSIDPPVKG GGSGLEDVVT PPPNGVGVRP PLVLQESYEF RVTGEGFRDG DGHLRWIQRD
GSWYQAQWID YELEANHLQQ LTGNVSVPLG DHTWEPIDIK DIYQLQVDNP RLTLYIPDPS
GSVLELDPDR PAASGP