Gene Daro_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1501 
Symbol 
ID3568953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1617197 
End bp1618597 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content63% 
IMG OID637679969 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_284720 
Protein GI71907133 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA GCGAAATCCA GGCCCTGCTC GACGAGCCGG CCTGTAGCCA CAACAAGAAG 
GAAAAGTCCG GCTGCGCCAA GCCCAAGCCT GGCGCCACGG CGGGCGGCTG TTCCTTCGAC
GGCGCGCAGA TCGCGCTGCT GCCGATTGCC GATGTTGCCC ATATCGTCCA TGGACCGATT
GCCTGTGCCG GTTCCTCCTG GGACAACCGC GGAACGCGCT CTTCCGGCGT CACGCTGTAC
AAGATCGGCA TGACCACCGA TCTGTCGGAA ACCGATGTGG TGATGGGCCG TGGCGAGAAG
CGGCTATTTC ATGCCATCAA GCAGGCGATC GACAGCTATT CGCCCTCGGC CGTCTTCATC
TACAACACTT GCGTCACGGC GCTGATCGGT GACGATGTCG GCGCCGTCTG CAAGGCGGCC
ACCGAACGCT GGGGGACGCC GGTTGTGCCG GTCGATGCGG CCGGTTTCTA CGGCACCAAG
AACCTCGGCA ATCGGCTGGC TGGCGAGGCG ATGTTCAAGC ATGTGATCGG TACCGCCGAG
CCTGCCCCTG CCGCACCGCG CGCCGACGGC CTGCCAACCT ACGACGTCAA TTTGATCGGC
GAATACAACA TCGCCGGTGA GTTCTGGCAT GTCGCACCGC TATTTGATGA ACTTGGCCTG
CGCATTCTTT GCACACTGTC CGGAGACTCG CGTTTCCATG AGGTGCAGAC CATGCACCGC
GCCAGGGTGA ACATGGTCGT CTGTGCCAAG GCATTGCTCA ACGTGGCACG CAAGATGGAA
GACAACTTCG GCATTCCCTT CTTCGAGGGT AGCTTCTACG GCGTGCAGGA TGTCTCCAAT
GCCTTGCGCG ATTTCGCCCG GCTGATCGGC GACCCGGATT TGACGGCGCG TACCGAGGCG
GTGATTGCCC GCGAGGAAGC CAAGTCGCAT GCCGCGCTGG AACCCTGGCG TGATCGCCTG
CGCGGCAAGC GGGTGCTGCT CTACACCGGC GGCGTCAAGT CGTGGTCCAT CGTCTCGGCC
TTGCAGGATC TGGGCATGAA GGTGGTAGCG ACCGGCACCA AGAAATCGAC CGAAGAGGAC
AAGGCGCGCA TCCGCGAGTT GATGGGTGAC GATACCAAGA TGATCGACGA CGGCAGCCCA
AAGGCCTTGC TCTCGACTTA CCACGAGTAC AAGGCCGACA TCCTGATCGC CGGTGGCCGC
AACCTCTACA CCGCCTTGAA GGCGCGCATT CCTTTCCTCG ACATCAATCA GGAACGCGAA
TTCGGCTACG CCGGCTACGA CGGCATGGTC GAACTGGCCC GCCAGCTGGC GCTATCGATG
GAAAGTCCGG TCTGGGCCGC CGTGCGCAAG CCAGCGCCGT GGGCGGCGCA AAAGGGGCCC
GGAACGGTGG TCGTGGCCTG A
 
Protein sequence
MKASEIQALL DEPACSHNKK EKSGCAKPKP GATAGGCSFD GAQIALLPIA DVAHIVHGPI 
ACAGSSWDNR GTRSSGVTLY KIGMTTDLSE TDVVMGRGEK RLFHAIKQAI DSYSPSAVFI
YNTCVTALIG DDVGAVCKAA TERWGTPVVP VDAAGFYGTK NLGNRLAGEA MFKHVIGTAE
PAPAAPRADG LPTYDVNLIG EYNIAGEFWH VAPLFDELGL RILCTLSGDS RFHEVQTMHR
ARVNMVVCAK ALLNVARKME DNFGIPFFEG SFYGVQDVSN ALRDFARLIG DPDLTARTEA
VIAREEAKSH AALEPWRDRL RGKRVLLYTG GVKSWSIVSA LQDLGMKVVA TGTKKSTEED
KARIRELMGD DTKMIDDGSP KALLSTYHEY KADILIAGGR NLYTALKARI PFLDINQERE
FGYAGYDGMV ELARQLALSM ESPVWAAVRK PAPWAAQKGP GTVVVA