Gene Anae109_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3046 
Symbol 
ID5375824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3553292 
End bp3555274 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content75% 
IMG OID640844571 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001380227 
Protein GI153005902 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTCACA TGCGAACGCC CGGACTCGAT CTTCGCCGCA AGATCCTGCT CGTCGCCCTG 
CTCCCCACCG CGCTGCTCGT GGCCGCCTTC CTCGTCGTGT TCGCCGTGCA GCGCAGCCGC
ATCGCCGGCC GCGTCGAGGC GAGCATGGGG CGCCTCGCCG AGGAGGGGCT CTCGCGGGCG
GCCCACGACC TGCAGACCCT CTGCGAGTCC GCTCACCGCG AGCTCTGGCT CCAGGTGCCG
CGCAGCCTGC GCGTCGCGCG CGACCAGATG GAGCGGCTCG GGCCGCTCTC CTTCTCCTCG
GAGACGGTGC GCTGGTCCGC GGTGAACCAG CTGGACCGCA GCGCGGCGGA GGTGGTCCTC
CCGAAGCTGC TCCTCGGCGG CGAATGGGCG GGCCAGAACG CCGATCCGGC CCGGCCGAGC
TTGCTCGTGG ATCGCGTGCG CGAGCTGGTC GGCGCCGAGG CCACGCTGTT CCAGCGCATG
AACGAGCGCG GCGACATGCT CCGCGTGGCG ACCACGGTGC CCGACGAGAA GGGCGCCCGC
GCGATCGGGA CCTACATCCC GGCGGTCGAT CCCGACGGCG GGGCGAACCC GGTCGTCTCG
ACCGTGCTGC GCGGCGAGAC CTACCGCGGG CGCGCGCGGG TGATCGACCG CTGGTACCTC
TCCGGCTACG AGCCGATCCG CGACGCGAGC GGACGCATCG CCGGGATGCT GTTCATCGGG
CTGCGGCAGG ACTCGCTCGA GGGCATCCGC GCCGGTGTGG CCGCCTCCCG CATCGGGCAG
ACCGGCGCCA TCCACGTGCT CGGGGCGAGC GGCAACCAGC GCGGCAAGTA CCTCATCCCG
CCGCCCGGCC ACGCGGACGG AGAGGACGCG TGGGAGGCGC GCGACGCCCG CGGCGAGGAG
TACGTCCAGC GCGTGCTGCG CGCGGCGAAG GAGGCGGAGG GCGGGACGGT CCGCCTGTCC
TACGCGCTGC GGGACGGCGC GGGGGCGCCC CGCGAGCGCG TGGCCGCGGT CACCTACTTC
GCGCCGTGGG ACTGGGTGAT CGTCGCGGAG ATGGACCGCA GCGAGGCGGT CGCCGCGCTG
CGCGAGGTGC AGTCCTCCCT CGCCGCGTCC GCCGTGACGG TGGTGGGCGT CGGCCTGCTG
CTCCTCCTCG CGAGCATCTG GGCCGCCCGC AAGGCGGCGA GCCGGCTCGC CGCGCCGCTC
GAGGCGATGG CGCTCGCCGC CGAGCGCATC GCCGAGGGAG ACGTGCAGCA GGAGGTGACC
TACCGCTCGG GCGACGAGGT CGGCCGGCTG GCGGAGGCCT TCCGCGGCAC CATCCGTTAC
ATCCAGGAGG TCGCCCGCGG CGCGGCAGCC GTCGCGCGCG GAGACCTCTC CACCCCGCTC
GTGCTCCGCT CCGACCGCGA CGAGCTGACG CGGAGCTTCC AGTCGGCGCA GTCCGAGCTG
CGGCGGCTGG TCGAGGACGC GGGCGCGCTC TCCCAGGCGG CGGTCGAGGG CCGCCTGACG
GTGCGGGCGG ATCCCTCGCG GCACCAGGGC GACTTCCGCA AGGTGGTGGA GGGCGTGAAC
GCGACCCTCT CGTCGCTCGT CGGCCACCTC GACGCCATGC CGGCGCCGGC CATGATCGTC
GGGACCGAGT TCGACATCCG CTACATGAAC CAGACGGGCG CGAGCCTCCT CGGGCGCACG
CAGCAGGAGC TCATCGGCAC CAAGTGCTAC GACAGCTTCC GCACCGGCGA CTGCAGAACC
GGGCGCTGCG CCGGCGGCCG CGCGATGGCG GAGGGCCGCG AGGTGAGCAG CGAGACCGAG
GCCCACCCCG AAGGGCTCGA TCTGGAGATC TTCTACTCGG CGGTGCCGCT CCGCGACGGC
GACGGGCGCG TGGTGGGCGC GCTCGAGGTG GTGACGGATC AGACCGCCCT CCGGCGGGGC
GCCGCGGAGG ATGGCGGAGG CCGCAGCGAG GGTCCGGAGT GCCGGCCGTG CCTCGTCCGG
TGA
 
Protein sequence
MPHMRTPGLD LRRKILLVAL LPTALLVAAF LVVFAVQRSR IAGRVEASMG RLAEEGLSRA 
AHDLQTLCES AHRELWLQVP RSLRVARDQM ERLGPLSFSS ETVRWSAVNQ LDRSAAEVVL
PKLLLGGEWA GQNADPARPS LLVDRVRELV GAEATLFQRM NERGDMLRVA TTVPDEKGAR
AIGTYIPAVD PDGGANPVVS TVLRGETYRG RARVIDRWYL SGYEPIRDAS GRIAGMLFIG
LRQDSLEGIR AGVAASRIGQ TGAIHVLGAS GNQRGKYLIP PPGHADGEDA WEARDARGEE
YVQRVLRAAK EAEGGTVRLS YALRDGAGAP RERVAAVTYF APWDWVIVAE MDRSEAVAAL
REVQSSLAAS AVTVVGVGLL LLLASIWAAR KAASRLAAPL EAMALAAERI AEGDVQQEVT
YRSGDEVGRL AEAFRGTIRY IQEVARGAAA VARGDLSTPL VLRSDRDELT RSFQSAQSEL
RRLVEDAGAL SQAAVEGRLT VRADPSRHQG DFRKVVEGVN ATLSSLVGHL DAMPAPAMIV
GTEFDIRYMN QTGASLLGRT QQELIGTKCY DSFRTGDCRT GRCAGGRAMA EGREVSSETE
AHPEGLDLEI FYSAVPLRDG DGRVVGALEV VTDQTALRRG AAEDGGGRSE GPECRPCLVR