Gene Ent638_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1454 
Symbol 
ID5114419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1607530 
End bp1608675 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content59% 
IMG OID640491640 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001176185 
Protein GI146311111 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.770918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA ATCTGTTTTG GTTTTTACCG ACCCACGGTG ATGGGCATTA TCTTGGTACT 
GAGGAAGGGG CGCGTCCGGT CGATTACGGC TACCTGCAAC AAATCGCGCA AGCCGCAGAC
AGAATTGGGT TCACCGGCGT GCTGATCCCC ACTGGCCGCT CCTGCGAAGA TGCCTGGCTG
GTGGCTGCGG CCATGATCCC TGTTACGCAG CGCCTGAAGT TTTTGGTCGC CCTGCGCCCA
AGCGTAGTTT CACCCACCGT TGCTGCACGC CAGGCCGCCA CGCTTGACCG ACTCTCCAAC
GGACGTGCGT TGTTTAATCT GGTCACCGGC AGCGACCCGA CGGAACTCGC GGGAGACGGC
GTTTTCCTTG ACCACACCGA ACGCTATGAG GCCTCGGCAG AATTCACCCG CGTCTGGCGT
CGTCTGCTTG AGGGCGACAC CGTCACCTAT GAAGGCAAAC ACATTCGCGT ACGTGATGCG
AAACTCTATT TCCCGCCCGT ACAGCAGCCG CGCCCTCCTC TTTACTTTGG CGGATCGTCG
GACGTTGCAC AGGATCTGGC GGCGGAACAG GTCGATCTGT ATCTCACCTG GGGCGAACCA
CCTGAACAGG TGAAAGAGAA AATTGAACAG GTCCGCGCCA AAGCTGCAGC GCATGGCCGT
AAAGTGCGTT TCGGCATCCG TCTGCACGTC ATTGTGCGAG AGACGAATCA GGAAGCCTGG
CAGGCCGCCG ATCGTCTGAT TTCACACCTT GATGATGACA CCATCGCCAA AGCGCAGGCC
GCCCTGGCCA AAACCGATTC AGTGGGCCAG CACCGGATGG CCTCCCTTCA CAACGGCAAA
CGCGAAAATC TCGAAATCAG CCCGAACCTG TGGGCAGGCG TTGGCCTGGT GCGCGGTGGC
GCAGGAACCG CGTTAGTGGG TGACGGCCCA ACGGTGGCGG CACGCATTAA CGAATATGCC
GATCTGGGGA TCGACAGCTT TATTTTGTCC GGTTATCCGC ATCTGGAAGA GGCGTATAAC
GTCGGCGAGC TGCTGTTCCC GCATCTGGAC GTCGCCATCC CGGAAATTCC GCAGCCGCGT
CCGCTGCAGG TTCAGGGCGA AGCAGTGGCG AATGAATTTA TTCCCCGCAA AACGGCACAG
AGCTAA
 
Protein sequence
MSLNLFWFLP THGDGHYLGT EEGARPVDYG YLQQIAQAAD RIGFTGVLIP TGRSCEDAWL 
VAAAMIPVTQ RLKFLVALRP SVVSPTVAAR QAATLDRLSN GRALFNLVTG SDPTELAGDG
VFLDHTERYE ASAEFTRVWR RLLEGDTVTY EGKHIRVRDA KLYFPPVQQP RPPLYFGGSS
DVAQDLAAEQ VDLYLTWGEP PEQVKEKIEQ VRAKAAAHGR KVRFGIRLHV IVRETNQEAW
QAADRLISHL DDDTIAKAQA ALAKTDSVGQ HRMASLHNGK RENLEISPNL WAGVGLVRGG
AGTALVGDGP TVAARINEYA DLGIDSFILS GYPHLEEAYN VGELLFPHLD VAIPEIPQPR
PLQVQGEAVA NEFIPRKTAQ S