Gene Daud_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0147 
Symbol 
ID6026896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp166688 
End bp168160 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content64% 
IMG OID641593003 
Productnitrogenase 
Protein accessionYP_001716347 
Protein GI169830365 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGCA ACTGTAACCC CATCCCCATC GGGGGCATCA CCTACAATCA ATTGAAAATC 
GAAAAAAAGC CCCTGACGCC GGAATACGTG CCGCGCGACC CGGCCCTGGT GCCCAAGGCC
AACCGGGTGG TGGTCGTCAA CCCCGCCCGG ACCTGCATGC CCCTGGGAGC GATGTTCGCC
GGGCTGGGGA TTCACCGCGG GCTGCCCTTC GTCCAGGGGG CCCAGGGTTG CACCACCTAT
GTGCGCTACA CCTTCTGCCG CATCTTCCAG GAGCCGGCCA GCATCGCCAA CGCCTCTTTC
CACGAGGACG CGGCCGTGTT CGGCGGGCGG AAGAACTTCA CCGAAGGAAT CCGCAACCTG
GTGGTGCGCT ATAGACCCGA CCTGATTACC GTGGTCACCA CCTGCTCCAG TGAAATCATC
GGCGACGACA TGGTCAGTTT CATCAAGGTG GCCAGAAAGC GCCTGGTTTC CGAGTTGGGT
CCGGAAGACG GGAACCGGGT GCGCCTGGTG CTGGTCAGTA CCCCGAGTTT CGCCGGCTCC
CACGTTGCCG GCTACGACCG GGCGTCCCGG GCCTTCCTGG AGACCCTGGC CACCGATCAC
TCGCGGCCGA ACAATAGGGT GAACATCATC CCCGGAATGC TGATGCCCGG CGACCTGCGG
GAGATCAAGC ACCTCCTGGC GGAGATGGGG GTGGAGGCGC ACGTCCTGTT CGATATCAGC
GACGTGTTCG ACACCCCGCT GATGCCGCCC CAGACCCTGC CCTACTACCC CGAGGGGGGC
ACCCGGGTGG AGGACGTCGA GGACATGGCC AACTCCATGG CCACCTTCGC CCTGTGCCCG
AACGAGGGCG GGCTCGGGGC CCGTTACCTG GAACAGAAGT TCGGGATTCC TGCTTTCCTC
GGGCCGGTGC CCGTGGGTGT ACACAACACC GATCTCTTCC TGGAGCGCCT GTTGCGGGTT
ACCGGAAAGG AGGTGCCCAA GGCACTGCGG GACGAACGCG GCCGGCTGCT CGACTTTATG
GCCGACACCC TGCACCACAC CATGATGAAA AAGGTGGCTC TGTTTGGTGA TCCGGACCTG
GTCACCGGGC TGACCCGTTT CGTGTGCGAA CTGGGCATGG AGCCGGTGGC GGTGATGAGC
GGGACGCAGA CCAAGACGTT TGCGGCCGAC ATCGAGGGCA TTACGGGTGA GTTCGGCTTC
ACCCCGGCCG TCTTCAACGG TTCGGACCTG TTTGAATTTG AGGAAACGCT CAAGACGATG
CCCGTGGAGG TGCTGATCGG CAATTCCAAG GGCGCCGACA TCGCCAAGGA ACTCCAGATC
CCGCTGGTCC GGGCCGGGCT TTTCGTGTAC GACCGCGTCG GATACCAGAA GCGCCCGGTA
GTGGGGTACA GAGGGGGGGA ACGCTTGCTG GCCGACCTCG TGAATGCCAT TCTGGATTTC
GGTTATCCCG AGGAGCGGAC CCAGCAGCTG TAA
 
Protein sequence
MAGNCNPIPI GGITYNQLKI EKKPLTPEYV PRDPALVPKA NRVVVVNPAR TCMPLGAMFA 
GLGIHRGLPF VQGAQGCTTY VRYTFCRIFQ EPASIANASF HEDAAVFGGR KNFTEGIRNL
VVRYRPDLIT VVTTCSSEII GDDMVSFIKV ARKRLVSELG PEDGNRVRLV LVSTPSFAGS
HVAGYDRASR AFLETLATDH SRPNNRVNII PGMLMPGDLR EIKHLLAEMG VEAHVLFDIS
DVFDTPLMPP QTLPYYPEGG TRVEDVEDMA NSMATFALCP NEGGLGARYL EQKFGIPAFL
GPVPVGVHNT DLFLERLLRV TGKEVPKALR DERGRLLDFM ADTLHHTMMK KVALFGDPDL
VTGLTRFVCE LGMEPVAVMS GTQTKTFAAD IEGITGEFGF TPAVFNGSDL FEFEETLKTM
PVEVLIGNSK GADIAKELQI PLVRAGLFVY DRVGYQKRPV VGYRGGERLL ADLVNAILDF
GYPEERTQQL