Gene Daud_0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0148 
Symbol 
ID6026993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp168164 
End bp169543 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content66% 
IMG OID641593004 
Productnitrogenase 
Protein accessionYP_001716348 
Protein GI169830366 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCAAC CGAAATGCGC AAGGCGGAGC AACACCGCCG AGCAGGTTCC CTTCATTGAC 
GCCGGCCGCG CCGACCACGG CGTGATTGTG AAACCGGAAC CGCGCCTGCC GGCCTGTGAC
CGGCAGACGG TGCCCGGAGC CATGTCCCAG CGTTCCTGTG CCTACTACGG CGCCCGCTGG
TTCCTGGCTC AGCTGAAGCA GGTGCTGCAC CTGGTGCACG GTCCGGTATC GTGTGCCTAC
TACGGCGAAA CGGTGCGCAA AAAGAGGTAC ACGGTCTTTT CGACCGACCT GACCGAAAAC
GAGGTCATCT TCGGGGGCGA GGCCAAGCTC TTACGGGTCT TGCGCGAACT GGCGGTCCAC
TTTCCGGAAA ACAGGGCGAT TTTGGTGTAC GTCACCTGCG CCCCCGGCAT CATCGGGGAC
GACGTGGACC GGGTCTGCCG CCGGGCCGAG CGGGACACGG GCATGCGCTG CGTGCCGGTG
CACTGCCCGG GCTTTTTGGG ATACCACCAG GCCGCCGGCC ACGAGGCCGG AGCGCGGGTG
TTCCTCGAGC ACTTCATCGG ACACGACGCG CCGGCTGAGC CCGGGCCGCG GGACATCAAC
CTCCTGGGCG AGTTCGACGT GATGGGGGAC TACCGCGTGA TCAGGGGGAT GCTCCGGAGG
ATGGGCGTAA ACGTGGTGAA CGCCGTCACC GCCGACGCTT CGGTTGAAAG CCTCGCGCGC
GCCCACCGGG TGCAGTTGAA CCTGATCCAC TGCCGCCGCA CCGGGGGCCT CCTGGCCGAG
GAGATGGAAA GCCGGTACGG GATCGAGTAC CGCAAGGGTT CCTTTTTCGG GCTGACCGAA
ACGAGCACGA CCCTGCGCCG GCTCGGGGAT TGGCTGGACT GCCGCGCGGA GGCCGAAGCG
CTGATCGCGG AGGGAGAGGC GATGGTCGCC GGCGATTTGG AACGGTACCG GCGGGAACTC
GCGGGCAAGC GGGTGGGTCT GTTCTTCGGC GGGTCGCGGA TCGGTTCCAT GCTCAAGGGT
TATCGCGACC TGGGCCTGGA GGTGGTGGCG GCCGGGAGCC AGTTCGGGTG CGGAAACGAC
TACCGCGAGG CCTGGACCGG CCTGAACCCC GGGGCGGCCC TGGTGGACGA CACCAACGAG
GAAGAATTCG TCCAGTTCAT CAGGCAGTAC CGTCCGGACG TCATTGCCGG CGGGACGCGG
GAAAAATGGC TGGCTCACAA GTTTGGGATT CCGTTCACTG TTTTTCCGCA GGAGAGCGGC
CCCTACGCCG GGTACTCCGG TTTCTTGAAC TTCGCCCGTG ACATTCTGGC CGCCCTCAAG
GCCCCGGTGT GGCGGATCGT GAGAGGGGGA TCCAATACAC ATGCACCGTT TACCGCTTGA
 
Protein sequence
MHQPKCARRS NTAEQVPFID AGRADHGVIV KPEPRLPACD RQTVPGAMSQ RSCAYYGARW 
FLAQLKQVLH LVHGPVSCAY YGETVRKKRY TVFSTDLTEN EVIFGGEAKL LRVLRELAVH
FPENRAILVY VTCAPGIIGD DVDRVCRRAE RDTGMRCVPV HCPGFLGYHQ AAGHEAGARV
FLEHFIGHDA PAEPGPRDIN LLGEFDVMGD YRVIRGMLRR MGVNVVNAVT ADASVESLAR
AHRVQLNLIH CRRTGGLLAE EMESRYGIEY RKGSFFGLTE TSTTLRRLGD WLDCRAEAEA
LIAEGEAMVA GDLERYRREL AGKRVGLFFG GSRIGSMLKG YRDLGLEVVA AGSQFGCGND
YREAWTGLNP GAALVDDTNE EEFVQFIRQY RPDVIAGGTR EKWLAHKFGI PFTVFPQESG
PYAGYSGFLN FARDILAALK APVWRIVRGG SNTHAPFTA