Gene Mvan_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3605 
Symbol 
ID4647170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3840020 
End bp3841573 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID639807079 
ProductCHAD domain-containing protein 
Protein accessionYP_954403 
Protein GI120404574 
COG category[S] Function unknown 
COG ID[COG5607] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.673296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.196907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCT ATGGTCCCCG AAACCACGGC AGAGACAGGG TGACCATGGC CGCCAATTCG 
CCCGAAACCT CCAGGCACAC GGAGACCGAG CGCAAATTCG AGGTGGTGGA GGCGACCGTC
TCCCCCTCTT TCGACGGGCT GTCGGCGGTC GGCCGTGTCC AGCGGCTGAC GTCACAATCC
CTCGATGCCG TGTACTTCGA CACACCCGGG CGTGACCTCG CCCGACATCG CATCACGCTG
AGGCGCCGCA CCGGCGGGAC CGACGCCGGG TGGCATCTGA AGCTGCCCGC GGGCGCGGAC
AGCCGAACCG AGGTGCACGC CCCCCTGTCC GACGACGGCG ACGCGGCGCA CCCGACCGTG
CCGGAGAGCC TGCGCGACAT CGTGCTGGCG ATCGTGCGGG ACCGTCCGCT GGCCCCGGTC
GCGCGTATCA GCACCCTCAG GAACGTCGAC GTGCTCTACG GGCCGGACGG GTCGGCGGTC
GCCGAGTTCT GTGATGACCA GGTGACCGCG AGCGCTGTCG GTGGCGACGA GCAGCACTGG
CGGGAGTGGG AACTCGAGCT GGGCCCCGGC ACCGGCTCCG AGGTGTTCGA CCGGCTCACC
AACCGTTTGC TGGATGCGGG TGCGCGGCCC GCCGGGCACG GCTCGAAACT GGCCCGCGTA
CTCGAGTCCA GCGCGCCGGA GGTGGAGGAC ACCGAAGCGG TCGCGGCGCC GACCGATCCG
GCGCGCCGCG CGGTGGCCGT CCACCTCGAA GAACTCATCG AATGGGATCG TGCGGTCAGA
GCGGATGCCT GGGACTCGGT GCATCAGATG CGGGTGACCA CCCGCAAGAT CCGCAGCCTG
CTTCAGTCGT CCGAAGGCTC ATTCGACATC GCCGATCACG AATGGGTCCT CGACGAGCTG
CGCGAGTTGG CCGCCGTGCT CGGTGTCGCC CGGGACGCCG AGGTGCTGGC CGAACGCTAT
CAACGCGCCA TCGACAAACT GCCCGAGGCG AACGTCCGTG GACCGGTGCG GGAAAGGCTC
GTCGACGGCG CCGAGAAGCG GTATCAGACG GGATGGAAGC GGTCGCTGAC CGCGATGCGT
TCGCAGCGCT ACTTCCGGTT GCTCGACGCG CTCGAGGAGC TGATCGCCAC CGAGCCTGTC
GCGTCCGGCC GGGGTGAGAC ACCGGCGGAG CTGACCATCG ACTCCGCGTA CAAGCGGGTC
CGGAAGGCCG CCAAGGCCGC TCGCGCCGCG GCCGCAGACG CCAAGACCGA GTCCGACGAG
GCGCTGCACC GGATCCGCAA GCGCGCCAAG CAGCTCCGCT ATACCGCCGC GGCCATCGGC
GAGAACAAGG TGGCCGAGCG GGCGAAGGTG ATCCAGTCGC TGTTGGGTGA CCACCAGGAC
AGCGTGGTCA GCCGGGCACA TCTGAGCCGG CAGGCCCAGG TGGCCCACGC CGCGGGCGAG
GACACCTTCA CCTACGGGCT GCTGTACCAG CAGGAGGACG ACCTCGCGTT GCGCTGCGAG
GAGCAGATCG AGGATGCGCT CCGGCAGCTC GACAAATCGG TGGGCCGGCG TTAG
 
Protein sequence
MARYGPRNHG RDRVTMAANS PETSRHTETE RKFEVVEATV SPSFDGLSAV GRVQRLTSQS 
LDAVYFDTPG RDLARHRITL RRRTGGTDAG WHLKLPAGAD SRTEVHAPLS DDGDAAHPTV
PESLRDIVLA IVRDRPLAPV ARISTLRNVD VLYGPDGSAV AEFCDDQVTA SAVGGDEQHW
REWELELGPG TGSEVFDRLT NRLLDAGARP AGHGSKLARV LESSAPEVED TEAVAAPTDP
ARRAVAVHLE ELIEWDRAVR ADAWDSVHQM RVTTRKIRSL LQSSEGSFDI ADHEWVLDEL
RELAAVLGVA RDAEVLAERY QRAIDKLPEA NVRGPVRERL VDGAEKRYQT GWKRSLTAMR
SQRYFRLLDA LEELIATEPV ASGRGETPAE LTIDSAYKRV RKAAKAARAA AADAKTESDE
ALHRIRKRAK QLRYTAAAIG ENKVAERAKV IQSLLGDHQD SVVSRAHLSR QAQVAHAAGE
DTFTYGLLYQ QEDDLALRCE EQIEDALRQL DKSVGRR