Gene Daro_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1413 
Symbol 
ID3569098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1541439 
End bp1543007 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content60% 
IMG OID637679881 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_284632 
Protein GI71907045 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.079976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCG CAGACAACAT CAAGCCCTGT TACCCGCTGT TCCGCGACGA CGACTACAAG 
AAGAACCTCG CTGAAAAACG CGAACTCTTC GAAGAAGGCC ACGGCCCGGA AAAGGTCTGG
GAAACCTTCG TCTGGACGAC GACCAAGGAA TATCAGGAAC TCAACTTCAA GCGCGAAGCC
CTGACCGTCG ATCCGGCCAA GGCCTGCCAG CCGCTCGGCG CCGTCCTCTG CGCCTCCGGC
TTCCACAAGA CCCTGCCCTA CGTGCATGGC TCGCAAGGTT GTGTCGCCTA TTTCCGCACC
TACTTCAACC GTCACTTCAA GGAACCGTGC TCCTGCGTGT CCGACTCGAT GACGGAAGAC
GCCGCCGTGT TCGGCGGGCA GAAGAACATG CACGACGGTC TGGCCAACGC CAAGGCCATC
TACAAGCCGG ACATGATCGC TGTCTCGACC ACCTGCATGG CCGAAGTCAT CGGTGACGAC
CTGAATGCCT TCATCAACAA CTCCAAGAAG GACGGCCACA TCCCCCAGGA TTACCCGGTC
CCCTTCGCCC ACACCCCGTC CTTCGTCGGT TCGCACACTA CCGGCTGGGA CAACATGCTC
GAAGGCATCA TCCGTTCCTT CACGCTGAAC TTCATGGACG GCAAGAAGGT CGGTTCGAAT
GGCAAGATCA ACATCGTGCC GGGCTTCGAG ACCTACCTCG GCAACTACCG TGTCATCAAG
CGCATGCTCG GTGAAATGGA TGTCGATTTC ACCTACCTGT CCGATCCTTC CGAAGTCTTC
GACACCCCGG CCGACGGCGA ATTCCGCATG TACTCCGGCG GCACGACGAT GGACGAGGTC
AAGGATGCGC CGAACGCCTA CACCACCATC CTGCTGCAGC CCACCCAGCT GGAAAAGACG
AAGAAATTCG TCGAAGCCAC CTGGAACCAG GACATCCCCA AGCTCAACAT CCCGATGGGC
GTGGAATGGA CCGACGAGTT CCTGATGAAG GTCTCCGAGA TCACCGGCAA GCCGATCCCG
GCCTCGCTGG AACTGGAACG CGGCCGCTGC ATGGACCTCG TCTCCGACAG CCACGCCTGG
CTGCACGGCA AGAAGTTTGC CGTGTATGGT GACCCGGACT TCGTGATGGG CGTCGTCAAG
ATCCTGCTCG AAGTCGGTGC CGAGCCGACC CACATCCTGG CCCACAACGG CAACAAGCGT
TGGGCCAAGA CCATGGACAA GCTCCTCGCC TCCAGCCCGT TCGGTGTCAA TGGCAAGGTG
TATGCCGGCA ACGACCTGTG GCACATGCGC AGCCTGTGCT TCACGGACAA GCCGGATTTC
CTGATCGGCA ACTCCTACGG CAAGTACATC CAGCGCGACA CCCGCCACCT GGGTGAAGAG
TTCGAAGTAC CGCTGATCCG CCTCGGCTTC CCGATCTTCG ACCGCCATCA CATGCACCGC
GCGACCACCA TGGGCTACGA AGGGATGATG TCCGTAGTGA CCCAGCTGAC CAACGCTGTC
CTTGAGCAAC TCGACAAGGA AACGATCGGC ATGGGCACCA CCGACTTCAA CTTCGACCTG
GTTCGCTAA
 
Protein sequence
MQTADNIKPC YPLFRDDDYK KNLAEKRELF EEGHGPEKVW ETFVWTTTKE YQELNFKREA 
LTVDPAKACQ PLGAVLCASG FHKTLPYVHG SQGCVAYFRT YFNRHFKEPC SCVSDSMTED
AAVFGGQKNM HDGLANAKAI YKPDMIAVST TCMAEVIGDD LNAFINNSKK DGHIPQDYPV
PFAHTPSFVG SHTTGWDNML EGIIRSFTLN FMDGKKVGSN GKINIVPGFE TYLGNYRVIK
RMLGEMDVDF TYLSDPSEVF DTPADGEFRM YSGGTTMDEV KDAPNAYTTI LLQPTQLEKT
KKFVEATWNQ DIPKLNIPMG VEWTDEFLMK VSEITGKPIP ASLELERGRC MDLVSDSHAW
LHGKKFAVYG DPDFVMGVVK ILLEVGAEPT HILAHNGNKR WAKTMDKLLA SSPFGVNGKV
YAGNDLWHMR SLCFTDKPDF LIGNSYGKYI QRDTRHLGEE FEVPLIRLGF PIFDRHHMHR
ATTMGYEGMM SVVTQLTNAV LEQLDKETIG MGTTDFNFDL VR