Gene Arth_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3897 
Symbol 
ID4445098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4388924 
End bp4389913 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content67% 
IMG OID639691722 
Productluciferase family protein 
Protein accessionYP_833372 
Protein GI116672439 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGTTC CGCTTTCCAT CCTTGACCTG GCCACCATCG CCAAGGACCA GACCGCAGCC 
GAGAGCTTTG CCGGCAGCGT GGCCATGGCG CAGCGGGCGG AGGAGCTGGG GTACCGCCGG
GTCTGGTACG CGGAGCACCA CAACATGTCC TCTATCGCTT CGTCGGCAAC GAGCGTGCTG
ATCGCCCACA TCGCGGCCAA CACCAGCACC ATCAGGCTGG GTGCCGGCGG CGTGATGCTG
CCCAACCACT CCCCGCTCAC CATCGCCGAG CAATTCGGCA CGCTGGAGAC CCTGCATCCG
GGCCGCATCG ACCTCGGTCT GGGCCGTGCG CCCGGCAGTG ACCAGAACAC CATGCGAGCA
CTCCGGCGCG ACCCGATGTC CGCGGACAGC TTCCCGCAGG ATGTCCTGGA ACTGCAGGGC
TACCTCACCG GCCCCACCCG CATCCAGGGC GTTGAGGCGA CGCCGGGCAA GGGCACCAAC
GTGCCCCTGT ACATTCTGGG CTCCTCGCTT TTCGGGGCCC GGCTCGCAGC GCAGCTTGGA
TTGCCGTACG CCTTCGCTTC GCACTTCGCT CCCAATGCCC TGCAGGAAGC GGTGGCCATC
TACCGGCGGG AATTCAAGCC CTCGGCCCAG CTGGATGCCC CGCACGTGAT TGCAGGCGTC
AACGTGATAG CGGCCGATTC CGCCTCCGAA GCCCAAGAGA TGTTCCAGGC CACCAAGCGC
GCCCGCGTGT CCCTGTTCTT TGGCAACGGC AGGGTGTTCA CCGACGACGA GGCGGACATG
ATCCTCGACT CGCCGCAGGG CCAGCACGTA GCCCAGATGA TGAAGTACTC GGCGATCGGG
ACCCCTGACG TGGTGATGGA CTACCTGGAC GAGTTTGCCG CCCACGCTGA TGCGGACGAA
CTGATCGTGG CCCACCAGAG CAACGGAACC GAGGCCCGCC TGCGGTCCGT GGAACTCCTC
GCCAGCGCTG CGGGGCTGGT CCGCGCCTAG
 
Protein sequence
MTVPLSILDL ATIAKDQTAA ESFAGSVAMA QRAEELGYRR VWYAEHHNMS SIASSATSVL 
IAHIAANTST IRLGAGGVML PNHSPLTIAE QFGTLETLHP GRIDLGLGRA PGSDQNTMRA
LRRDPMSADS FPQDVLELQG YLTGPTRIQG VEATPGKGTN VPLYILGSSL FGARLAAQLG
LPYAFASHFA PNALQEAVAI YRREFKPSAQ LDAPHVIAGV NVIAADSASE AQEMFQATKR
ARVSLFFGNG RVFTDDEADM ILDSPQGQHV AQMMKYSAIG TPDVVMDYLD EFAAHADADE
LIVAHQSNGT EARLRSVELL ASAAGLVRA