Gene Mvan_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1220 
Symbol 
ID4644267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1307911 
End bp1309326 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content75% 
IMG OID639804718 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_952061 
Protein GI120402232 
COG category[R] General function prediction only 
COG ID[COG2144] Selenophosphate synthetase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.104607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTCC AAGCGATCTC ACCCCACCGC GGCGAGTTGT CGATCCTGGC CGGGTCGCGG 
CCTGCGACCG GCGGGCAGTT CCTGATCCGC CTGGCCGGCG CGCCGGAGCT GCGGGCCTAC
CACCGGTTGC GGCAGGAGTC GTTCGTCGTC GAGCAGGGCA TGTTCAGCGG CACCGACCGT
GACGACCTCG ACGACGATCC CCGGACCGTG GTGCTGGTCG CCGTCGCCGC CGACGGCACG
GTGCTCGGCG GCGTCCGGCT GGCGCCGGCC TGCGCACCGG ACATCGGGTG GTGGACGGGC
AGCAGGTTGG TGGCCGACCC GGCCGCGCGC GCGGCGGGCC TGGGCCCGGC GCTGGTCCGC
GCCGCCTGCG CGCATGCGGA ATCGGTCGGC GCGCTGCGCT TCGAGGCCAC GGTGCAGCAC
CGCTACGCGC CGATGTTCGT CGGGCTGGGC TGGACCGACG AAGGCGGGTG TGTGGTGGCG
GGCCGGCCGC ACGTGGTGAT GCGATGGCCG CTCGATCCCG TTCAGCGCGT CGCGGCCGCG
ACGAAGTCCT TTCTCGGCGA GGTGCTCGAT CCGCTGCGTC GGGTGCCGAA CGGGCTGGGT
CCCAAGGGTT TCGTCGGCGA CGACGGCGTA CCGGTGCCGG GCGGTGACGT GGTCGCCGCG
TGCGACGCGA TCATCCCGTC GATGGTCGAA CGGGACCCGG AGTGGGCGGG CTGGTGTTCG
GTGCTCGTCA ACGTCAACGA CCTGTCGGCG ATGGGTGCGA CACCCACCGG CCTGCTCGAT
GCGGTCGGCG CGCCGAACCG GTCACTGCTG ACGCGCATCG TGCGCGGGGT GGCCAACGCC
TCGCAGGCGT GGCGGGTCCC GGTGCTCGGC GGGCACACCC AGCTCGGGGT GCCGGCATCG
CTGGCGGTGA CGGCGCTGGG CCGCACGGCC GATCCGGTGC CCGCGGCCGG CGGCGCGGCG
GGTGATGCGC TGCGTCTGAC CGTGGACCTG AGCGGCCGGT GGCGGCCGGG GTACCACGGC
AGGCAGTGGG ACTCGACGAG CGCCAGATGC CCCGACGACC TGGCGCGGAT GGGGTCCTAT
GTCGCCGCGG CCCGCCCGCG GGCGGCCAAA GACGTCAGCA TGGCCGGTAT CGCGGGGACG
GCGGGGATGC TCGCCGAGGC CGGCGGCGTC GGCGCCGAGA TCGACGTCGC CGCGGTCCCC
CGCCCCCGGG ACGCCGACAT GGGGTCCTGG ATGACGTGTT TCCCGGGCTT CGGGATGCTC
ACCGCGGGCG GGGTGGGGGC GGCGCCGTTG CCCGACGGGG TCGCCAGCCG GGTGTGTGGT
CGGCTGACGG CGCAGCCCGG GGTGCGGCTG CGCTGGCCCG ACGGGGTGGT GACGACGGCG
CTGCCCGCCG GCGTGACGGG GTTGGGACGG GCGTGA
 
Protein sequence
MLFQAISPHR GELSILAGSR PATGGQFLIR LAGAPELRAY HRLRQESFVV EQGMFSGTDR 
DDLDDDPRTV VLVAVAADGT VLGGVRLAPA CAPDIGWWTG SRLVADPAAR AAGLGPALVR
AACAHAESVG ALRFEATVQH RYAPMFVGLG WTDEGGCVVA GRPHVVMRWP LDPVQRVAAA
TKSFLGEVLD PLRRVPNGLG PKGFVGDDGV PVPGGDVVAA CDAIIPSMVE RDPEWAGWCS
VLVNVNDLSA MGATPTGLLD AVGAPNRSLL TRIVRGVANA SQAWRVPVLG GHTQLGVPAS
LAVTALGRTA DPVPAAGGAA GDALRLTVDL SGRWRPGYHG RQWDSTSARC PDDLARMGSY
VAAARPRAAK DVSMAGIAGT AGMLAEAGGV GAEIDVAAVP RPRDADMGSW MTCFPGFGML
TAGGVGAAPL PDGVASRVCG RLTAQPGVRL RWPDGVVTTA LPAGVTGLGR A