Gene Vapar_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3871 
Symbol 
ID7969728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4100395 
End bp4101504 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content69% 
IMG OID644794457 
ProductAlkanesulfonate monooxygenase 
Protein accessionYP_002945751 
Protein GI239816841 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA ACGACGTTGA ATTCATCGGC ATGATCCAGG GCCAGAAGGT CTCGGAGATC 
CACGCGGCCA AGGGCCCGGC CATCGACCGC GACTACGTGC GCGCCTTTGC CCAGGCGCAC
GAGAACGCGG GCTTCGACCG CGTGCTGGTG CCGCACCATT CCACCGGCCC CGACGCCACG
CTGACCGTGG CCCATGCCGC ATCGGTCACC GAGCGCATCC ACTTCATGCT TGCGCACCGC
CCGGGCTTCG TGGCGCCCAC GCTCGCGGCG CGCCAGTTCG CCTCGCTCGA CCAGTTCAGC
GGCGGCCGGC TCGCGGTGCA CTACATCTCG GGCGGCTCCG ACGAGGAACA GCGCCGCGAC
GGCGACTGGC TCGGCCATGA CCAGCGCTAT GCGCGCACCG ACGAATACCT CGACGTGCTG
CACAAGGTGT GGACCGCCGA CAAGCCCTTC GACCATGAAG GCGCGCACTA CCGCTTCCAG
AACGCTTTCT CCGAAGTGAA GCCGCTGCAG ACGCGCAACG GCAAGCCGCA TGTGCCGGTG
TACTTTGGCG GTGCCTCGGA GGCGGCGATT CCGGTGGCCG GCAAGTACGC CGACGTGTAT
GCGCTGTGGG GCGAGTCGCT CGACCAGGCG CGCGAACTCA CGGCGCGCGT GCGTGCAGAG
GCCGCGAAGC ACGGGCGCAG CGTGCGCTTC TCGGTGTCGT TCCGCCCGAT CCTGGCGCAG
ACCGAAGAGG CCGCATGGGC GCGCGCCGAG AACATCCTGG CCGAGACGAA GCGCCTGCGC
GTGGTGCAGG GCTACAACCG CGGCGGGCCG CAGCAGAGCG AAGGTGCGAA GCGCCTGCTC
GCCGCGGCCG AGAAGGGCAC GCGGCTCGAC AAGCGGCTGT GGACGGCGGT GGCGCAGGAG
ATCGGCGGCC GCTCCAACAG CACGGCGCTG GTCGGCACGC CCGAGCAGGT GGCCGATGCG
CTGCTCGACT ACTACGACCT GGGCGTGACC ACCTTCCTGA TCCGCGGCTT CGATCCGCTG
GAAGACGCGG TCGACTATGG ACGCGAGCTG ATTCCGCGTG TGCGCGAGCT GGTGGCGCAG
CGTGCGGCTT CGGCTCGCAA AGCGGCGTGA
 
Protein sequence
MSQNDVEFIG MIQGQKVSEI HAAKGPAIDR DYVRAFAQAH ENAGFDRVLV PHHSTGPDAT 
LTVAHAASVT ERIHFMLAHR PGFVAPTLAA RQFASLDQFS GGRLAVHYIS GGSDEEQRRD
GDWLGHDQRY ARTDEYLDVL HKVWTADKPF DHEGAHYRFQ NAFSEVKPLQ TRNGKPHVPV
YFGGASEAAI PVAGKYADVY ALWGESLDQA RELTARVRAE AAKHGRSVRF SVSFRPILAQ
TEEAAWARAE NILAETKRLR VVQGYNRGGP QQSEGAKRLL AAAEKGTRLD KRLWTAVAQE
IGGRSNSTAL VGTPEQVADA LLDYYDLGVT TFLIRGFDPL EDAVDYGREL IPRVRELVAQ
RAASARKAA