Gene Daud_0146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0146 
Symbol 
ID6026717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp165256 
End bp166704 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content62% 
IMG OID641593002 
Productnitrogenase component I, alpha chain 
Protein accessionYP_001716346 
Protein GI169830364 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTTTC TCAGGCTCAA GTGTGATGAA CTGATCCCCG AACGGGAAAA GCACACTTAC 
ATCACGGATA GGGAGAACCC CGTCATCCCG CTTTGCAACA TCAACACGAT CCCGGGGGAT
ATGACCGAGC GCGGGTGAGC GTTCGCGGGT GCCCGCGGGG TCGTGGGCGG ACCAATCACC
GATGCTATCC AGATCGTGCA CGCGCCGGTC GGATGTGCCT ATTACACCTG GGCGACGCGG
CGTCACCTTT CCGACCAGTA CGCCTGGAGC ATGCCCGGCC GGCTGGACAA CGTGGCCTTC
AACCGCCGCT TCTGTGTGTG CACCGACATG GAGGAAAAGG ATGTTGTCTT CGGCGGGACC
AAGAAACTCC TAAAGAGCGC GCTGGAGGCG GTAAGACTGT TTCCGGAAGC GACGGGGATC
ATCATGTACA CCACCTGCAC CACCGGTCTG ATCGGGGACG ATATCGGGTC GGTGGCCAAG
CAAATCGAAC GGGAGACCGG AAAGCCGGTG TTCTTCGCGG AGGCGCCCGG CTGTTCCGGG
GTGAGCCAGT CCAAGGGTCA TCACGTGGCG AACCGGCAGT TTTTCGAACA GATCAACGAG
ATCCGGCGCC GGCGCCCGGA ACTGATCACA CCCGAGGCGG AGCGGACGCC TTACGACGTC
GTCCTGGTCG GGGAGTACAA CATGGACTGG GACCTCAAGG TGATCCTGCC GCTGATGGAG
AGCATCGGGA TGCGGGTGGT AAGCACCTTC ACCGGAAACG CCCGGATGAT GGACCTGGTC
CGGCTGCCGG ATACCAAGCT CAACGTGGTG CACTGCCAGC GGTCGGCAAC CTACATCGCG
GATATGATCA AGGAAGGCTA CGACATTCCC TACGTCAAGG CGTCGTTCTT CGGTATCCAG
CAGACGAGCA AGGCCCTGCG GACCATTGCC CGCCATTTTG GTCTGGAGGA GCGGGCGGAG
CAGGTCATCG CCGCGGAGAC GATCCGCATC CAGCCGGCCC TGGAGTGGTA CCGCGAGCGC
CTGCAGGGCA AGACCGTGGC CGTCTACGTC GGGGGGCCCC GGGTCTGGCA CTGGATCAAG
CTCTTTGAGG AACTGGGCAT GAAGGTGGTG GCCGGAGCCT GCACCTTCGC CCACGAGGAC
GACTACGAGA AGATCAACGC CCGCGCCGGT GACGGGGTGC TGATCATCGA CAACCCGAAC
GAGTTCGAGA TCGAGGAGTT GCTGGAAACC TGTAAACCGG ATATCTTCCT CTGCGGCCTG
AAGGAGAAGT TCCTGGGCCG CAAGATGGGC GTGCCCACCC TGAACTCCCA TTCCTACGAG
AAAGGCCCCT ACGCGGGGTA CGTGGGGTTC ATCAACTTCG CCCGCGACAT CTACCAGGCG
CTTTACGCCC CGGTATGGCG CCTGACCAAC GGAAAGGAGG CCGTTACCAA TTATGGCCGG
CAACTGTAA
 
Protein sequence
MPFLRLKCDE LIPEREKHTY ITDRENPVIP LCNINTIPGD MTERGUAFAG ARGVVGGPIT 
DAIQIVHAPV GCAYYTWATR RHLSDQYAWS MPGRLDNVAF NRRFCVCTDM EEKDVVFGGT
KKLLKSALEA VRLFPEATGI IMYTTCTTGL IGDDIGSVAK QIERETGKPV FFAEAPGCSG
VSQSKGHHVA NRQFFEQINE IRRRRPELIT PEAERTPYDV VLVGEYNMDW DLKVILPLME
SIGMRVVSTF TGNARMMDLV RLPDTKLNVV HCQRSATYIA DMIKEGYDIP YVKASFFGIQ
QTSKALRTIA RHFGLEERAE QVIAAETIRI QPALEWYRER LQGKTVAVYV GGPRVWHWIK
LFEELGMKVV AGACTFAHED DYEKINARAG DGVLIIDNPN EFEIEELLET CKPDIFLCGL
KEKFLGRKMG VPTLNSHSYE KGPYAGYVGF INFARDIYQA LYAPVWRLTN GKEAVTNYGR
QL