Gene Mvan_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1137 
Symbol 
ID4646701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1208513 
End bp1209793 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID639804636 
ProductHipA domain-containing protein 
Protein accessionYP_951979 
Protein GI120402150 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.965885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCC GAACCCTGCG GGTCTACCTC GATGGAACCC CGATCGGCAC CATGACGCAA 
TCCAGCCATG GTGCTTTGGG TTTCACCTAC GACGGCGCGT ACACCGGGCA GGACGACCCG
ACACCGCTGT CGCTGTCGAT GCCGATCCCT TCCTCCCGGC ATCGGGACAA AGCAGTGCGG
GCCTACCTGG AGGGCTTGCT TCCGGACCGC GAAGGTGTGC GACAGCGATG GGCACGCGAG
TACAGCGTGT CGCCGAACAA CCCGTTCGGG CTGCTGGCGC ACGTCGGTCG CGACGCCGCC
GGCGCCGTCC AAATCCTTCC GCCCGACCTC GACCCGGCCG ACGCCCGCGC CTGCGACGGC
GACATCCAAT GGCTCAGCGC GGCCGACCTC TCAGATCTCG CCCGAGATCT CACCACCCAC
CAATCCGACT GGAATCCAGG CAGATTCGAG GGTCGGTGGA GCCTCGCCGG CGCACAACCG
AAAATCGCGC TGTTCCGAGA CACGAAGTCC GGACGGTTCG GCATCCCGCG TGATTCCACA
CCGACGACGG CGATCCTCAA GCCCGCACTC GTCGGCTACA CACAACATCA CATCAACGAG
GCGTTGTGTC AGCGTGCCGC ACGTGAGGCT GGGCTGCTCG CCGCCGAATC CGAGCTGACG
CAGATCGGTG AGGTGCAGGT GTTGATCTCG ACGCGCTACG ACCGCCGTCA CGACGGGACG
TTGTGGCATC GGGTCCACCA GGAAGACATG TGTCAGGCGT TGTCGGTCCA CCCCGCCCTC
AAATACCAGT CCGATGGAGG GCCAGGTGTC GGTGATGTCG CCGACCTGCT CAACAGGCTC
CCGGTCGAGG ACCGCGCTGT GAATGCCGAG CGATTCTTCA AGGCGCTCAC CTACAACGTC
CTGATCGGCG GCACCGACGC TCACGCGAAG AACTACTCAC TTGTCCTCAT GGGATCACGC
GCCCAGGTGG CGCCCATGTA TGACGCCGCC TCGGCTGCGC CGTACGACCA GCGCGACCAC
CTGCGTTCCT CCATGAAGAT CGGTGAACAC TGGAAAATGC TCGATGTCAA CAATTCCGAC
TGGGCCAAGG TGGGACGCCG TCTCGGCATC TCCGCGGAGC AGGCCACGGC GTGGGTAGGC
GAACTCCGCA ACAAACTTCC GGATGCATTC GAGCGCGCCG TGGCTTCACT GGCGCTGAGC
GCACGACCCG AGGCGGGACG CATGGCCGAG CGGATCATTG AGCACGTCGC GGGTACCTGG
AAGCCCACTC TGCCTCGCTG A
 
Protein sequence
MATRTLRVYL DGTPIGTMTQ SSHGALGFTY DGAYTGQDDP TPLSLSMPIP SSRHRDKAVR 
AYLEGLLPDR EGVRQRWARE YSVSPNNPFG LLAHVGRDAA GAVQILPPDL DPADARACDG
DIQWLSAADL SDLARDLTTH QSDWNPGRFE GRWSLAGAQP KIALFRDTKS GRFGIPRDST
PTTAILKPAL VGYTQHHINE ALCQRAAREA GLLAAESELT QIGEVQVLIS TRYDRRHDGT
LWHRVHQEDM CQALSVHPAL KYQSDGGPGV GDVADLLNRL PVEDRAVNAE RFFKALTYNV
LIGGTDAHAK NYSLVLMGSR AQVAPMYDAA SAAPYDQRDH LRSSMKIGEH WKMLDVNNSD
WAKVGRRLGI SAEQATAWVG ELRNKLPDAF ERAVASLALS ARPEAGRMAE RIIEHVAGTW
KPTLPR