Gene Anae109_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3359 
Symbol 
ID5374344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3925845 
End bp3927869 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content69% 
IMG OID640844873 
ProductNifA subfamily transcriptional regulator 
Protein accessionYP_001380527 
Protein GI153006202 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAG TAGCTACCCG GTATGACGAG CGCGGCATCA CCATGGACAC CCCTCCCTCT 
CTACGGACGC CCGGGCAGCA GGCGATGACG GACGAGGGCT GGCACCAGCT CTTCGAGCAC
TCGGCCATCG GGGTGACGCT CGCCGATCTC GAAGGGCACC TCGTCCACGT GAACCGCGCC
TACTGCGCGA TGCTCGGCTA CACGGAGGTC GAGCTCGAGG GGCACTCGTA CGTCTCCCAT
GCACACCCGG ATGATCGCGC GCGCCACCTG ATTCTCGTTC GCGAGCTGCT CGCCGATGCC
CGCGCCCATT TCCAGGTCGA GGAGCGGTAC GTCCGCAAGG GCGGCTCGGT GATGTGGGTC
AGCAACAGCG TGTCGCTGGT GCCGCCCCGG GGCGGCTCGC GCCGGATCCT CATCCTCGGG
CTCGTCGAGG ACATCACCGA GCGCATGCGC CTCCGTGACG AGCTGGACGC CGAGCGGAAC
CGGCTACGCC TTCTGCTCGA CGTCAACGAG CTCCTGGTCG CGCACCTCGA CCTTCGGGAG
ATGTTCCAGG CGCTCGCGTC GAGCCTGCGG AGGGTCACGG ATTGCCACTT CATCGGTCTC
GCCCTGCCCG ACGCCGCGAC GGGCGAGCTG CGGCAGCACA TCGTCGACCA TCCGGACGGG
AAGGGTGCCA TCACCGAGGG CATGGTGCTG CCTCTTCACG GCTCCGCCTC CGGCAAGGCG
TTTCGCACGG GCGCGCCCGT CTTGTTGAAC GACCCGGAGG CGAACCGCCA GGACCCGGAC
CTGTATGGCA CTCCCGAGGG AGCGCGGTTC TATCGGACCG TGCTCGAGGA AGGAGTTCCT
TCGGGATACG TCCTGCCGCT CGTTCACCGT GGCGAGGTGC TGGGCGTCCT CCAGCTCAAG
AAGTACGCGG ACGCTCGATT CAAGGAACGA GAGATCGAGT TCATGTCCAA GGTGGCGGGC
CAGCTCTCGA TCGCGGTGGC GAACGCCCTC GAGTACCGCG AGGTCAAGGA GTCGAAGGAG
CGGCTGGACA GGGAGCGGGT CTACCTGAAG GAGGAGATCC GGTCTGCGCA CGACTTCGAG
GAGATCATCG GGGTGAGCCG CACGCTGAAG CAGGTGCTCG GCCAGATCGA CACGGTCGCG
GTCACGGACT CGACCGTCCT CATCCTGGGG GAGACCGGCA CGGGCAAGGA GCTGATCGCG
CGCGCCATTC ACAACCGCAG CCGGCGGCGT GACCGTCCAT TCGTGAAGGT CAACTGCTCC
GCGATCCCCA CCGGGCTCCT CGAGAGCGAG CTCTTCGGCC ACGAGCGCGG CGCCTTCACC
GGGGCCACCG CGCCCAGGAT CGGACGCTTC GAGGCGGCCG ACCAGGGGAC GCTGTTCCTC
GACGAGATCG GGGACCTCCC CGTGGACCTG CAGCCCAAGC TGCTCCGGGT CCTCCAGGAG
CGCGAGTTCG AGCGGTTGGG CGCCAGCCGC ACGCGACGGG TCGACGTCCG GGTCGTCGCG
GCGACGAACC GGGGACTCGC CACGATGGTC GGGGAGGGCA GGTTCCGGGA GGATCTGTAC
TACCGGCTGA ATGTCTTCCC CATCACGCTT CCACCGCTGC GGGAGCGCGC CGGGGACATC
CCGCTCCTCG TGCGGCACTT CGTCGGCGTC TACGCCCGGC GGATGGGCAA GCAGATCGAC
CACATCCCCG ACGCGTCCAT GCGCGCGCTG GTCGGCTATC ACTGGCCGGG CAACGTACGC
GAGCTGCAGA ACGTGATCGA GCGGGCGGTG ATCCTCACCC CCGGTGCGGT CCTCGAGCTG
GCGCTCGCCG AAAGGGCCGC CGGCGCCCGG GAAGATCGAC CGGACGCCGC GGCACCCAAC
GGCCACCGCA CGCTGCAGGA GGTGGAGCGT GAGCACATCC TGGGCGCGCT CCAGGAGGCC
AAGTGGGTGA TCGGCGGCCC GAACGGCGCG GCCGCGCGTC TCGGCCTACG GCGCACCTCG
CTCATGTACC GGATGGAGAA GCTGGGCATC GCTCGACCGA CGTGA
 
Protein sequence
MTEVATRYDE RGITMDTPPS LRTPGQQAMT DEGWHQLFEH SAIGVTLADL EGHLVHVNRA 
YCAMLGYTEV ELEGHSYVSH AHPDDRARHL ILVRELLADA RAHFQVEERY VRKGGSVMWV
SNSVSLVPPR GGSRRILILG LVEDITERMR LRDELDAERN RLRLLLDVNE LLVAHLDLRE
MFQALASSLR RVTDCHFIGL ALPDAATGEL RQHIVDHPDG KGAITEGMVL PLHGSASGKA
FRTGAPVLLN DPEANRQDPD LYGTPEGARF YRTVLEEGVP SGYVLPLVHR GEVLGVLQLK
KYADARFKER EIEFMSKVAG QLSIAVANAL EYREVKESKE RLDRERVYLK EEIRSAHDFE
EIIGVSRTLK QVLGQIDTVA VTDSTVLILG ETGTGKELIA RAIHNRSRRR DRPFVKVNCS
AIPTGLLESE LFGHERGAFT GATAPRIGRF EAADQGTLFL DEIGDLPVDL QPKLLRVLQE
REFERLGASR TRRVDVRVVA ATNRGLATMV GEGRFREDLY YRLNVFPITL PPLRERAGDI
PLLVRHFVGV YARRMGKQID HIPDASMRAL VGYHWPGNVR ELQNVIERAV ILTPGAVLEL
ALAERAAGAR EDRPDAAAPN GHRTLQEVER EHILGALQEA KWVIGGPNGA AARLGLRRTS
LMYRMEKLGI ARPT