Gene Daro_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1414 
Symbol 
ID3569099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1543076 
End bp1544548 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content59% 
IMG OID637679882 
Productnitrogenase molybdenum-iron protein alpha chain:nitrogenase component I, alpha chain 
Protein accessionYP_284633 
Protein GI71907046 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.13472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC TGTCTCGCGA AGAAACCGAA GCCCTCATTG CCGAGGTGCT TGAGGTTTAT 
CCGGAAAAAG CCCAGAAGGA CCGTGCCAAG CATCTGGCGG TCAATGACCA ATCTGTTGAA
CAATCCAAGA AGTGCATCAC TTCCAACCGC AAGTCCCTGC CGGGCGTCAT GACCATTCGT
GGTTGTGCCT ACGCTGGCTC CAAGGGCGTG GTCTGGGGTC CGATCAAGGA CATGATCCAC
ATCTCGCACG GCCCTGTCGG CTGCGGTCAG TATTCCCGCG CCGGTCGCCG TAACTACTAC
GTCGGCACCA CCGGCGTCGA TACCTTCGGC ACGATGAACT TCACCTCCGA TTTCCAGGAG
AAGGACATTG TGTTCGGCGG CGACAAGAAG CTCGCCAAGC TGATCGACGA AGTCGAACTG
CTCTTCCCGC TGCACAAGGG AATCTCGGTC CAGTCCGAGT GCCCGATCGG TCTGATCGGC
GACGACATCG AGTCCGTCTC CAAGAAGGCT GCCGCCGTCA TCGACAAGCC GGTCGTCCCG
GTGCGTTGCG AAGGCTTCCG CGGTGTTTCC CAGTCCCTCG GCCACCACAT CGCCAACGAC
GCGATCCGTG ACTGGGTGCT CGACAAGCGC GACGGCGCCG CCTTCGAATC GACTCCGTAC
GACGTCGCCA TCATCGGTGA CTACAACATC GGTGGCGACG CCTGGGCCAC CCGCATCCTG
CTCGAAGAGA TGGGCCTGCG TGTTGTGGCT CAGTGGTCCG GCGACGGCAC CATTGCCGAA
ATGATGAACA CGCCGAAGGT CAAGCTGAAC CTGCTGCACT GCTACCGTTC GATGAACTAC
ATCTCGCGTC ACATGGAAGA GAAGTACGGC ATTCCTTACC TGGAATACAA CTTCTTCGGC
CCGACCAAGA TCATCGAATC CCTGCGAAAG ATCGCCGCCT TCTTCGACGA TTCCGTCAAG
GAAAAGACCG AAGCCGTGAT CGCCCGCTAC AAGCCGATGA TGGACGAGAT CATCGCCAAG
TACAAACCGC GTCTCGAGGG CAAGAAGGTC ATGCTCTACG TCGGCGGCCT GCGTCCGCGT
CACGTTATCG GCGCCTACGA AGATCTCGGC ATGGAAGTCA TCGGCACCGG CTACGAATTC
GGCCACAACG ACGACTACGA CCGCACGATC AAAGAGATGG GCGATGCCAC CCTGCTCTAC
GACGACGTCA CCGGTTTCGA GCTGGAAGAA TTCGTCAAGC GCCTGAAGCC CGACATGGTC
GGCTCCGGCA TCAAGGAAAA GTACATCTTC CAGAAGATGG GCATTCCGCT GCGCCAGATG
CACTCCTGGG ATTACTCGGG CCCGTACCAC GGTTACGACG GTTTCGCGAT CTTCGCCCGT
GACATGGACA TCGCCCTGTC CAACCCGACC TTCAAGAACC TTGTTCCGCC ATGGAAGAAG
GTTGCTGCTG AAGAAGTCAA GAAGGCCGCC TGA
 
Protein sequence
MTTLSREETE ALIAEVLEVY PEKAQKDRAK HLAVNDQSVE QSKKCITSNR KSLPGVMTIR 
GCAYAGSKGV VWGPIKDMIH ISHGPVGCGQ YSRAGRRNYY VGTTGVDTFG TMNFTSDFQE
KDIVFGGDKK LAKLIDEVEL LFPLHKGISV QSECPIGLIG DDIESVSKKA AAVIDKPVVP
VRCEGFRGVS QSLGHHIAND AIRDWVLDKR DGAAFESTPY DVAIIGDYNI GGDAWATRIL
LEEMGLRVVA QWSGDGTIAE MMNTPKVKLN LLHCYRSMNY ISRHMEEKYG IPYLEYNFFG
PTKIIESLRK IAAFFDDSVK EKTEAVIARY KPMMDEIIAK YKPRLEGKKV MLYVGGLRPR
HVIGAYEDLG MEVIGTGYEF GHNDDYDRTI KEMGDATLLY DDVTGFELEE FVKRLKPDMV
GSGIKEKYIF QKMGIPLRQM HSWDYSGPYH GYDGFAIFAR DMDIALSNPT FKNLVPPWKK
VAAEEVKKAA