Gene Saro_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1469 
Symbol 
ID3916134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1508743 
End bp1509897 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content60% 
IMG OID640444212 
Productluciferase-like 
Protein accessionYP_496746 
Protein GI87199489 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTC ACCTGATGCA GACCGGCGTT GTCGGGCGCC GTCATGAAAT CGAACAAGGC 
ATGGCTGGAC AGCGCCCCGA ACTATACCAG CGCTTTCTCG ATGAAGTGCG CGGATACGTT
CGGCTGGCCG ACGACCTTGG CTTCTACGGC TACTGTCAGC CCGAACACCA CCTCCAGATT
GAAGGCTTCG AGGCCAACAA CCATCCGGGC ATGTTCAGCC TCTTCGTCGG CCAGAACTGC
ACGAAGATGA AGGCAGGCAT CATGGGCTAC ACCATGACCA CGCACAATCC GGTGCGCGTT
GCCGAGGAAA TCGCCACGCT CGACCACATG TTGCAAGGCC GCCTCTACTG CGGCTTCACC
CGAGGCTATC ACGCCCGCTG GGTCGACGCC TATGGCATCA AGGAAGGTGT CAGCGCCACC
ACCCCAGACA ACGTAAAGGC GCGCGATGCG CAGGATGCGC TAAACCGCTC GCTGTTCGAG
GAAGGCGTGC GTGTGGTCAA GAAGGCTTGG ACAAACGACG TGTTCAACCA CAAGGGCGAG
AACTGGCAAT TCCCGCCGGA AGGCGGCTCC ACCGGGCATC CCGCTTACGC CAAATATGGG
AAGGGACAGG ACGAGAACGG CATCGTGCGC GAGATTGGCA TCGCTCCGCG CTGCTACCAG
GATCCCCACC CACCGATCTA CGGCGGCTTC GCAGCGTCTG GCCGGACCAT CGATTTCTGG
GCTGAAGAAG GCGGCAAGCT GATCGTACTG TCGGACAACC TCGACTTCTG CGAATCACTG
AACAACCGCT ACATCGCCAC GGCCGCCAAG AACGGCCGCG AAGTTACCCG CACCGACGCA
TCGGCCTGGG GTGGCTTCCT GATGCTGACC GACGACAAGG ACCGCGCTGA ATACCTGATG
AAGGAACACA TGTGGTTCTG GGACGAATGG TTCATTCCGC TCGGCCAGCG TCCGCCCAAC
GTGCTGATCG GCAGTGCGGA CGAAATTGCC GACAAGATCG GCCAGGCGCA TGATCGCCTC
GGCTTCGACG AACTTTTCCT GATGTTCGGG CAGGGTCATC TGGAGCCCGA AGCGAATCAG
GAAGAGCTTG AGAAATTCAT CAGCAAGGTA GCTCCGCGCT TCTCGACCAA GGACGAAAAC
GGCGTCTTCG TCTGA
 
Protein sequence
MKFHLMQTGV VGRRHEIEQG MAGQRPELYQ RFLDEVRGYV RLADDLGFYG YCQPEHHLQI 
EGFEANNHPG MFSLFVGQNC TKMKAGIMGY TMTTHNPVRV AEEIATLDHM LQGRLYCGFT
RGYHARWVDA YGIKEGVSAT TPDNVKARDA QDALNRSLFE EGVRVVKKAW TNDVFNHKGE
NWQFPPEGGS TGHPAYAKYG KGQDENGIVR EIGIAPRCYQ DPHPPIYGGF AASGRTIDFW
AEEGGKLIVL SDNLDFCESL NNRYIATAAK NGREVTRTDA SAWGGFLMLT DDKDRAEYLM
KEHMWFWDEW FIPLGQRPPN VLIGSADEIA DKIGQAHDRL GFDELFLMFG QGHLEPEANQ
EELEKFISKV APRFSTKDEN GVFV