Gene Anae109_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0171 
Symbol 
ID5374191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp197724 
End bp199214 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content79% 
IMG OID640841683 
Producthypothetical protein 
Protein accessionYP_001377373 
Protein GI153003048 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0303739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.463043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCG GCGCGGGTGA GATCGAGGCG GTCGTCCGGG AGCTCGCGCC CCTCGTCGGC 
TCCCGCGTCG ACGCGGTGCG GGTCCACGCC GAGCGCGCGC TGACCCTCGA GCTGTTCGGG
CGCGCCGGGC CCGTGCTCCT CCTCCTCTCC GCCGAGCCGG ACGTCACGCG CCTCCACGTC
ACGCGCGCCC GCCCGCCGCA GCCCGCCACG CCCTTCCCCT TCCAGGTGCT GCTGCGCCGC
GAGATCGAGG GCGCGCGCCT CGCCGCGATC GAGACGCTCC CGGGGGATCG GGTGGTGGCG
CTCGCGCTCG AGACGCCGCG CGGCCGCCTG CGGCTCGTCG GGGAGCTGAC CGGCCGGCAC
GGCAACCTGT TCCTGCTGGG CGACGACGGG ATCATCCGGG CGAGCGCCGG GCGCAACCTC
TCGCAGCGGC GCAAGCTCGT CGCCGGCGAG CCGTACGTCG CGCCCGCGCC CCCGCCTTCG
CCCCGCGTCG AGCGGCCGCG CTTCACGGCC GAGCCCTCGG GCCCGTTCCC GCTCTCGGCC
GCGGTCGAGG CGCGCTACGC GGCCCTCGTG CAGGAGCGGC TCGTGGCGGA GGGCCGCCGC
CGGCTCCGCG AGCCGGTCCG CGCGGGCGTC GCCCGCGCCT CGCGCGCGCT CGAGAAGCTG
GCCGACGAGG CGGCGCGCGT GCCGGTCGCC GAGGCCGACC GGCGCGCGGC GGACCTCCTC
AAGGCGAACC TCCGCGCGGT GAAGCGCGGG GAGCGCGAGG TGACGCTCAC GGAGTGGACG
GAGGAGGGGC CGCGCGAGGT GCGCGTCGCC CTCGATCCCG CCCTCGCGCC GCAGGCCAAC
ATGGAGCGGC TCTACCGCCG CTATCGGCGC ATCGTGGAGA GCGCGGAGCG CGTGGCCGCG
CGGACCGCGG GGGTGCGGGG CCGCGAGGCG GCGCTCCGCT CGCTCCTCGG CGAGATCGAC
TCGGCGCCGC TCGAGGAGCT CCCCCGCCTC GAGCGCGAGG CGCGCCGGCT CGGGGCGGGC
CCCCGGCCGC AGCCGCAGCC GCAGCCGCGC CCCGCCGGGC CCGGCGGCCG CGCGCGGGCG
CGCCCGGAGC CGCTCCCCTA CCGCGTCTTC CGCTCCGCCG CCGGCGCCGC CATCCTCGTC
GGCCGTGGCG CCGCCGAGAA CGATCGCCTC ACGCTCCGGG TCGCGCGCGG CAACGACCTC
TGGCTGCACG CCCGCGGTGT CCCCGGCGCG CACGTGGTGG TGCGGCTCGA GAAGGGCCGC
GGCCCCGACC AGGGGACGCT CCTCGACGCC GCCCACCTCG CCCTCCACTT CTCCGACGCG
CGCGGCGCCC CGCAGGCGGA CGTCGCGTAC ACACGCGCGA AGTACGTCCG CAAGCCCAAG
GGCGCAGGCC CGGGCGCGGT GACGTACAGC CAGGAGAAGG TCCTGCTGCT CCGCACGGAG
CCGCAGCGCA TCGCGCGGCT GCTGGCGGAG GAGGAGGGCG GCCAGGAGTA G
 
Protein sequence
MSLGAGEIEA VVRELAPLVG SRVDAVRVHA ERALTLELFG RAGPVLLLLS AEPDVTRLHV 
TRARPPQPAT PFPFQVLLRR EIEGARLAAI ETLPGDRVVA LALETPRGRL RLVGELTGRH
GNLFLLGDDG IIRASAGRNL SQRRKLVAGE PYVAPAPPPS PRVERPRFTA EPSGPFPLSA
AVEARYAALV QERLVAEGRR RLREPVRAGV ARASRALEKL ADEAARVPVA EADRRAADLL
KANLRAVKRG EREVTLTEWT EEGPREVRVA LDPALAPQAN MERLYRRYRR IVESAERVAA
RTAGVRGREA ALRSLLGEID SAPLEELPRL EREARRLGAG PRPQPQPQPR PAGPGGRARA
RPEPLPYRVF RSAAGAAILV GRGAAENDRL TLRVARGNDL WLHARGVPGA HVVVRLEKGR
GPDQGTLLDA AHLALHFSDA RGAPQADVAY TRAKYVRKPK GAGPGAVTYS QEKVLLLRTE
PQRIARLLAE EEGGQE