Gene Mvan_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0228 
Symbol 
ID4647966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp243506 
End bp244936 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID639803737 
Producthypothetical protein 
Protein accessionYP_951083 
Protein GI120401254 
COG category 
COG ID 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.393674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.149834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGGC TTAGTGGCTG GGACGCTGTG CTGCTGTACA GCGAGACCCC GAACGTCCAC 
ATGCACACGC TCAAGTTGGC CGTCATCGAG CTCGACGACA CGTTCGCCCG GGAAGGCGGT
GCGACCTTCG GTGTCGAGGA GCTTCGCAAG GTCATCCACG GACGGCTCTA CAAACTCGAC
CCGTTCCGCT ACCAGCTGAT CGACATCCCG TTCAAGTTCC ACCACCCGAT GTGGCGGGAG
AACGCCGAGG TCGACCTCGA ATACCACGTT CGGTCATGCC GTGTCGATGC GCCGGGCGGT
CGTCGCGAGC TCGACGAGGC GGTGGGCAGG ATCGCGAGTA CCCCGCTGGA CCGCAGCCGA
CCGCTGTGGG AGATGTACCT GATCGAAGGT CTGGCGGGCG GCCGGATCGC GGTACTCGGA
AAGATCCATC ACGCCCTGGC CGACGGTGTC GCGTCGGCGA ACCTGCTGGC GCGCGGCATG
GATCTGCAGG ACAGCCCGCA GGCCGACCGG GACTCCTACG CCACCGACCC GGCCCCGACC
CGGGGCGAGC TGGTCCGGTC GGCGTTCACC GATCATCTCC GGCAGATCGC CAAGCTGCCC
GGGGTGGTGC GCTACACCGC CCAGGGGGTG CGTCGGGTGC AGCGCAGCGA GCGCAAGCTC
TCGCCCGAGC TGACGCGACC GTTCACCCCG CCGCCGACGT TCATGAACCA CATGGTCGAC
GCCACCCGCA GGTTCGCCAC CGCCACCGTG GCGCTCGACG ACGTCAAGCA GACCGGCAAG
CAGCTGGGCG TCACCATCAA TGACATGGTG CTGGCGATGT CCGCAGGGGC ATTGCGAAAG
TTGTTGCTGC GGTACGACGG TCGTGCCGAT CATGCGCTGC TGGCGTCGGT GCCGGTGAGT
TTCGACTTCT CCCGCGACCG GATCTCCGGT AACTACTTCA CCGGTGTGCT GGTCAGCCTC
CCGGTGGACG TCGAGGATCC GCTGGAACGG GTCAGCGCCG CCCACACCGC CGCGGCGGCG
GGCAAGGAGA GCAACAACCT GATCGGTCCC GAGTTGGTCA GCCGGTGGTC GGCTTATTTC
CCGCCGGCCC CGGCCGAGGC GATGTTCCGC TGGCTGTCGA ACAAGGATGG CCAGAACAAG
GTGATGAACC TGCCGATCTC CAACGTGCCG GGTCCCCGCG AGCGCGCCCG TGTCGGCGGT
GCGTTGGTCA CCGAGATCTA CTCCGTCGGC CCGCTCACCG CGGGCAGCGG CCTCAACATC
ACCGTGTGGA GCTACGTCGA CCAGATCAAC ATCTCGGTGC TTTCGGACGG CAAGACGCTC
GACGATCCCC ATGAGCTCAC CACGGCCATG GTCGACGAGT TCATCGAGAT ACGCCGTGCC
GCAGGACTTT CCACGGAGCT GACGGTGATC GAAACGGCGA TGGCCAACTA G
 
Protein sequence
MKRLSGWDAV LLYSETPNVH MHTLKLAVIE LDDTFAREGG ATFGVEELRK VIHGRLYKLD 
PFRYQLIDIP FKFHHPMWRE NAEVDLEYHV RSCRVDAPGG RRELDEAVGR IASTPLDRSR
PLWEMYLIEG LAGGRIAVLG KIHHALADGV ASANLLARGM DLQDSPQADR DSYATDPAPT
RGELVRSAFT DHLRQIAKLP GVVRYTAQGV RRVQRSERKL SPELTRPFTP PPTFMNHMVD
ATRRFATATV ALDDVKQTGK QLGVTINDMV LAMSAGALRK LLLRYDGRAD HALLASVPVS
FDFSRDRISG NYFTGVLVSL PVDVEDPLER VSAAHTAAAA GKESNNLIGP ELVSRWSAYF
PPAPAEAMFR WLSNKDGQNK VMNLPISNVP GPRERARVGG ALVTEIYSVG PLTAGSGLNI
TVWSYVDQIN ISVLSDGKTL DDPHELTTAM VDEFIEIRRA AGLSTELTVI ETAMAN