Gene Anae109_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4031 
Symbol 
ID5376652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4720687 
End bp4722426 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content71% 
IMG OID640845558 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001381193 
Protein GI153006868 
COG category[T] Signal transduction mechanisms 
COG ID[COG4564] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.822958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTTGA AGCTCAAGCT GCAGCTGGCC ATCTCGGTCG CGGTGCTGGT CACCGCGGTC 
GTCGTCGCGG GCTACGCCGT CGGCAGCCTC CGCTCGCAGG CGGCGGCGGA CGTGGCGCGC
ATCCGGGCGG AGAAGACCGC GCAGGTGAAG CAGGACCTCG CCGACAAGGT GAACACGGTC
TACGCGCTCA TCGACGCGCA GTACCGGGAG GCCGCGGACG AGCGCTACCT CGAGCAGAAG
TACGGCCAGC GGCTCGAGGC GATCCTCGAC GTGGCGGGCG CGACGCTCCG CGAGCACCTC
GAGCGCGCGC GGCGGGGCGA GGTGCCGGTC GCGCGGGCGC AGGCGGAGGC GCTCGCGACG
CTCCGCGCGA TGCGCTTCGA CGGCGGCAAG GGCTACCTCT GGATCAACAC CGTCGGCCGG
CCCTACCCCA CGATGCTGAT GCACCCGCTC GTCCCGAGGC TCGAGGGGAC GGTCCTCGAC
GCGCCCGAGT TCGACTGCGC CGGCGACGCG AACCGCAACC TGTTCCAGCT CGCCGTCGAG
CTCACGGCCG CGCGCGGCGA GGGCTTCATC CGCTACAGCT GGCCCGAGCC GGACGGCAAC
GCGCTGCTGC CGCAGATGCC CAAGTTCTCC TACGTCCGCC TGTTCAAGGA GTGGGGGTGG
GTGGTCGGGA CCGGCATCTA CGTGGACGAG GCGGTGCGAG AGAAGCTCGC GGAGATCACC
TCGGGCATCC GCAAGATCCG CTACGGCTCC GAGTACTTCT GGATCAGCAC GGCCGAGTCG
CCGGTCCCGC GGATGGTGAT GCACCCGATC CGGCCCGAGC TCGACGGCCA GCGGATGGAC
GCGAAGGAGT TCGAGCTCGT CGTGAACGGC CGCGCGCAGA ACCTGTTCTC GGCGTTCCGG
GACCTCTCGT CGAAGGACGG CGAGGGGTTC GTGGAGTACA CGTGGCCGAA GCCGACGGCG
GCGGGTGCGC CGGGGCCGGC CGCGCCGAAG GTCTCCTTCG TGAAGCGCTA CCGGCCGCTC
GACTGGATCA TCGGCACCGG CCGGTACGTC GACGACATCG AGGTCGCCAT CGCCGAGAAG
ACGGCGGCCG CCGAGGAGCA GGTGGGCGCC CTCGTCCGCC GCATCATGCT GGCCTCGCTC
GTGGTCGTCG TCCTGGCGGT CGCCGGCGTG AGCTTCCTCG CGGCGACCCT GACGAAGCCG
CTCGCCAAGC TCGTCGGCCT GTCGCGGGAC ATCGCGGAGG ACGAGAAGCA CCTGTCGCGC
CGCATCGGCC TGCGCTCGCG CGACGAGATC GGGCAGCTCG CCACCGAGTT CGATCACATG
GCCGAGCGGG TCGAGGCGAG CTTCCGGAAC GTCCGCGAGC AGCGCGAGCT CCTCCTGAGC
GTCCTGTCGA ACGTCCCGCA CTCCATCTAC TGGAAGGACC GGCGCTCCGT TTACCTGGGG
TGCAACGACC GGTTCGCGCA GCGGTTCGGC CTCCCCTCCA CGGAGGCGAT CGTCGGGAAG
ACCGACGCGC AGCTCGGCTG GAGCGACGAG CAGCGCGCGC GCCTGGCGGC GGGGGATCGG
CGCGTCATGG ACGAGGGGGC GCCGCTGCTC GACGAGCGCG AGCACCTGCG CGACGCGTCC
GGGCGGGACA TCGACTGCCT GGCGAGCAGG GTGCCGCTGC GCGACCAGGC CGGGAACCTC
ATCGGGATGC TCGGGGTGTT CGTCGCGATC CCGCCGGACG ATCGCGCGCT GAGCGCGTGA
 
Protein sequence
MTLKLKLQLA ISVAVLVTAV VVAGYAVGSL RSQAAADVAR IRAEKTAQVK QDLADKVNTV 
YALIDAQYRE AADERYLEQK YGQRLEAILD VAGATLREHL ERARRGEVPV ARAQAEALAT
LRAMRFDGGK GYLWINTVGR PYPTMLMHPL VPRLEGTVLD APEFDCAGDA NRNLFQLAVE
LTAARGEGFI RYSWPEPDGN ALLPQMPKFS YVRLFKEWGW VVGTGIYVDE AVREKLAEIT
SGIRKIRYGS EYFWISTAES PVPRMVMHPI RPELDGQRMD AKEFELVVNG RAQNLFSAFR
DLSSKDGEGF VEYTWPKPTA AGAPGPAAPK VSFVKRYRPL DWIIGTGRYV DDIEVAIAEK
TAAAEEQVGA LVRRIMLASL VVVVLAVAGV SFLAATLTKP LAKLVGLSRD IAEDEKHLSR
RIGLRSRDEI GQLATEFDHM AERVEASFRN VREQRELLLS VLSNVPHSIY WKDRRSVYLG
CNDRFAQRFG LPSTEAIVGK TDAQLGWSDE QRARLAAGDR RVMDEGAPLL DEREHLRDAS
GRDIDCLASR VPLRDQAGNL IGMLGVFVAI PPDDRALSA