Gene Arth_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1300 
Symbol 
ID4446189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1459104 
End bp1460459 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID639689108 
Productputative nitrilotriacetate monooxygenase 
Protein accessionYP_830794 
Protein GI116669861 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0896409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCC ACCAGCGCAC CCTGCATCTC AACGCCTTCC TCATGAGTAC CGGCCACCAC 
GAAGCGTCCT GGCGGCTGCC CGAAAGCAAC CCCCGGGCGA GCACGGACAT CACCCACTAC
CGCAATCTCG CCCGGATCGC CGAGCGGGGC ACGTTCGATT CCATCTTCTT TGCCGACTCG
CCCGCGATCT TCGGAAACGT GGGCCGCCGG CCGGCCGGAA AGCTCGAACC GACAGTGCTG
CTGACCGCCA TCGCCGCAGC CACCGAGAAG ATCGGCCTCA TTGCCACGGC CTCCACCACG
TACAACGACC CGTTCAACCT CGCGAGGCGC TTTGCATCCG TCGACTTCGT CAGCGCGGGC
CGTGCCGGCT GGAACGTCGT CACTACCGCC GGGCCCGACG CCGCCCGGAA CTTTGGCGTC
GACGACCAGC CCGCGCACGC AACGCGCTAT GAGCGGGCGG CAGAGTTCCT GGACGTCGCC
GGCAAACTCT GGGACAGCTG GGACGATGAC GCCGTGGTGG CGGACAAGGA GGCGGGCGTC
TGGGCCGACC CGGACAAGGT CAGGCCGATC GACCACGTCG GAAAGCATTT CCGGGTTCGC
GGGCCGCTCA ACGTTCCGCG GTCCCCGCAG GGATACCCGC TGATTGTCCA GGCGGGCTCC
TCCGAAGACG GCAAGGGCCT CGCCGCGCGC TACGCCGAAG CCGTGTTCAC GGCGCAGCAG
ACGCTCGAGG ACGCGCAGAG CTTCTACAGC GACCTCAAGG CCCGCACCGC AGCCGCCGGA
CGAGACCCTG AAGGCATCAA GATCCTGCCC GGAATCGTGC CCGTGCTGGC AGGGACAGAA
GCCGAGGCGA AGAAGCTGGA GCGTGAACTC GACGAGCTGA TCCGTCCGGA ATATGCGCGG
ATAGAACTTG CCAAAACCCT CGGTGTCAGC CCGGACGACC TCCCGCTGGA CCGGCAACTG
CCGGCAGACC TTCCCGATGA GGATTCCATC CAGGGTGCCA AGAGCCGCTA CACCCTGATC
GTGGAGCTCG GCCGTCGCGA ACGGCTCACA GTGCGGCAGC TGATCGGCCG CCTGGGCGGC
GGCCGCGGAC ATCGCACGTT CTCGGGTACT CCGGAGCAGG TGGCCGATGC CATCCGGCTC
TGGTTTGAAA ACGGCGCGGC CGACGGCTTC AACATCATGC CCGCGGTGCT CCCCTCAGGG
CTTGAGGCCT TTGTGGACCA CGTGGTGCCG GTCCTGCGCC GGCGGGGCCT GTTCCGGACG
GAATACACAG CCGACACGCT GCGGGGGCAC TACGGACTGG AGCGCCCGCT CAACCGGTAC
TCCGGTGCTG CCGGACTGGC TGGCGCGCAC GCCTAA
 
Protein sequence
MASHQRTLHL NAFLMSTGHH EASWRLPESN PRASTDITHY RNLARIAERG TFDSIFFADS 
PAIFGNVGRR PAGKLEPTVL LTAIAAATEK IGLIATASTT YNDPFNLARR FASVDFVSAG
RAGWNVVTTA GPDAARNFGV DDQPAHATRY ERAAEFLDVA GKLWDSWDDD AVVADKEAGV
WADPDKVRPI DHVGKHFRVR GPLNVPRSPQ GYPLIVQAGS SEDGKGLAAR YAEAVFTAQQ
TLEDAQSFYS DLKARTAAAG RDPEGIKILP GIVPVLAGTE AEAKKLEREL DELIRPEYAR
IELAKTLGVS PDDLPLDRQL PADLPDEDSI QGAKSRYTLI VELGRRERLT VRQLIGRLGG
GRGHRTFSGT PEQVADAIRL WFENGAADGF NIMPAVLPSG LEAFVDHVVP VLRRRGLFRT
EYTADTLRGH YGLERPLNRY SGAAGLAGAH A