Gene Mvan_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1404 
Symbol 
ID4646424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1494568 
End bp1495674 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID639804904 
Producthypothetical protein 
Protein accessionYP_952244 
Protein GI120402415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.582743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTG ACCGTCTCCG TTCCTTCGCC GATGCGTCCG GCCCGTTCGT GTCGCTCTAC 
GTCGACGACA CGAGGGACAA TCCCGACGCC GAGAAGCGGG CCACCATCCG GTGGCAGGCG
ATTCGGCGGA ACCTCGAAGA CAGCGGCGCC GCCGAGCACA TGATCGGCGC CGTCGAACGT
GCGCTGCTGC ACAGTCAGCC GGGCGTGGGA CGCGGCGGCC GTGCGGTGAT CGCAGGGCGC
GAGGGCGTCC TGCTCAACGA GCACCTCGGT GCGCCGCCGT CGATCACGGT GCTGCGGGTC
TCGGAGTACC CGTACGTCCT GCCCCTGCTC GACCTCACGA CCAGCAATCC CGCGTACGTG
TTCGCCGCCG TCGACCATCT GGGCGCCGAC CTCACCGGGC ACCGCAACGG ACTGGTGCAC
CGCGAAACGG TCGACGGACA CGGTTATCCG GTGCACAAGC CGGTCTCCGC CGGATGGCAG
GGATACGACG ATCATGAGCG CTCGGCCGAG GAGGCCGTTC GGATGAATGC CCGCGCGGTC
GCCGACCGCA TCACCGACCT CGTCGACCGC GGTGGCGGCG AACTCGTGTT CCTGTCGGGC
GAAGTCCGGG CCCGTACCGA TGTCGTGGCC GCTCTGCCGC AACGCATCGC CGACCGCGTC
GTCACGCTCC CGGCCGGCGC CCGCGGCGGC CGGGCCACCG AACGCGAGCT GGCCGGCGAG
ATCGACGCCG AATTCGCGCG CCGCCAACGC GACGGGGCGA ACGCCGCCCT GGCGCGGTTC
AAGGCCGAGT CCGCGCGCAA CTCCGGATTG GCCGTCGAAG GCCTTCCCGA CGTCTGCACG
CCGCTGCGGG CCGGCTCCGT GGGCGCGCTG ATCGTCGGCG ACATGGGGAG TGCCACGGTC
GTCTGTGGCC AGAGCCGCAC CACGATCGCG CCCGACGCCG ACACGTTGTC CGAACTCGGT
GAAGCGCCGA GCGGCGTCGT GCTGGCCGAC GAAGCCGTGC CGTTCCTGGC GATCTCGACC
GATGCCACGG TGGTGCGGGC CGGCGACGGG GCGCAGCTCA CCGACGGGAT CGCGGCGGTG
CTGCGTTATC CGCTCAGGGT TGCATGA
 
Protein sequence
MPIDRLRSFA DASGPFVSLY VDDTRDNPDA EKRATIRWQA IRRNLEDSGA AEHMIGAVER 
ALLHSQPGVG RGGRAVIAGR EGVLLNEHLG APPSITVLRV SEYPYVLPLL DLTTSNPAYV
FAAVDHLGAD LTGHRNGLVH RETVDGHGYP VHKPVSAGWQ GYDDHERSAE EAVRMNARAV
ADRITDLVDR GGGELVFLSG EVRARTDVVA ALPQRIADRV VTLPAGARGG RATERELAGE
IDAEFARRQR DGANAALARF KAESARNSGL AVEGLPDVCT PLRAGSVGAL IVGDMGSATV
VCGQSRTTIA PDADTLSELG EAPSGVVLAD EAVPFLAIST DATVVRAGDG AQLTDGIAAV
LRYPLRVA