Gene Anae109_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1139 
Symbol 
ID5377445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1290733 
End bp1292403 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID640842647 
ProductNusA antitermination factor 
Protein accessionYP_001378331 
Protein GI153004006 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0315403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGA ACGTGAACCT GAACCTCATC CTCGACCAGG TCGCCAAGGA CAAGGGCATC 
GACCGCACGC GCCTCGTCGA GATCCTCGAG GAGGCGATCG GCAGCGCCGC GAAGCGCCAC
TTCGGGATGG AGCGGAACCT GAAGGCCCGC TACGACGAGG AGAAGGGCCA GGTCGATCTC
TTCCAGGTCC TCACCATCGT CACGGACCCG ACCGAGGAGA CCCCCCTCGC CGACCCGGTG
AACATGATCC CGGTGTCGGT CGCGCACGAG AAGGGCATCG AGGTGGAGCC GGGCGACGAG
CTCGACTTCC CCATCTACTA CCGCACCGAG GACGAGGCGG AGGCGCGCGC CCAGGACGAG
CAGTGGGGCG ACCTGCTCAA GCTGAAGACC TACCGCCGCT CCTTCGGCCG CATCGCGGCG
CAGACCGCGA AGCAGGTGAT GATCCAGGGC ACCCGCAACG CCGAGCGCGA GAACGTCTTC
AACGAGTACA AGGACCGCAA GGGCGAGGTC ATCACCGGCA TCGTGCGGCG CTTCGAGCGC
GGTAACGTCA TCGTCGACCT CGGCCGCGCC GAGGCGGTGC TGCCGGTGCG CGAGCAGGTG
CCGCGGGAGA GCTACCGGGC CGGCGACCGG ATCCAGGCCT ACGTGATGGA CGTGCTGCGC
GAGTCCAAGG GGCCGCAGAT CATCCTCTCG CGCGCGTCCG TCGATCTCCT CCGGAAGCTC
TTCGAGATGG AGGTGCCGGA GATCGCCGAG GGGGTGGTGG TGATCGAGGC CGCGGCCCGC
GAGCCGGGCG GGCGGGCGAA GATCGCGGTC TCCTCGCGCG ACTCGGACGT GGATCCCGTC
GGCGCCTGCG TCGGCATGAA GGGCAGCCGG GTCCAGGCGG TCGTGCAGGA GCTCCGCGGC
GAGAAGATCG ACATCGTGCC GTGGGACGAC GACTACGCCC GCTTCGTGTG CAACGCGCTC
GCGCCGGCCG AGGTCTCCCG CGTCCTCCTC GACGAGCAGA ACAAGGCGAT GGAGATCATC
GTCCCCGACG ACCAGCTCTC GCTCGCCATC GGGCGCCGCG GCCAGAACGT GCGGCTCGCC
TCGCAGCTCA CCGGCTGGAA GCTCGACATC AACTCCGAGT CGCGCGTGAA GGAGATGCGC
GAGTTCGCGA CCGAGAGCTT CGGCGCCATC GGCATCCCCG AGGCCACGCA GGAGATGCTG
TACGCGCACG GCTTCCGCAA GGCGCAGGAC GTGGCGAACG CCGCCTCCGA GATGCTCACC
CAGTTCCCGG GCTTCACGAT GGACATGATC CCGGAGCTGC AGAAGCGCGC CCGCGAGCAG
TCGATCGTCG ACGCGGAGAA GGAGATGCGG CTCGAGCAGG AGCGCGAGGC CGCCCGCATC
GCCGAGGCGC GGCGCCACCC CGACGAGCTC ACGCAGGAGG AGCGCTTCGC GCGCGTCCGC
GGCGTCGGCG AGAAGACCAT CGAGCAGCTG AAGGTCGCCG GCTACGGCAG CGTCGAGGCC
GTCCACAACG AGTCGGACGT GATGCGGCTC GCCGAGTCGA GCGGGCTCGG CATCAAGAAG
GCCCGCCAGC TCAAGCACGC GGTGGGCGTC TACCTCGAGG AGGAGGTCAA GCTCCGCGCC
GAGCTCGACG CCGAGCGGGC GAAGGCCGCG CAGGGGGGCG CCGGCGCTTG A
 
Protein sequence
MQQNVNLNLI LDQVAKDKGI DRTRLVEILE EAIGSAAKRH FGMERNLKAR YDEEKGQVDL 
FQVLTIVTDP TEETPLADPV NMIPVSVAHE KGIEVEPGDE LDFPIYYRTE DEAEARAQDE
QWGDLLKLKT YRRSFGRIAA QTAKQVMIQG TRNAERENVF NEYKDRKGEV ITGIVRRFER
GNVIVDLGRA EAVLPVREQV PRESYRAGDR IQAYVMDVLR ESKGPQIILS RASVDLLRKL
FEMEVPEIAE GVVVIEAAAR EPGGRAKIAV SSRDSDVDPV GACVGMKGSR VQAVVQELRG
EKIDIVPWDD DYARFVCNAL APAEVSRVLL DEQNKAMEII VPDDQLSLAI GRRGQNVRLA
SQLTGWKLDI NSESRVKEMR EFATESFGAI GIPEATQEML YAHGFRKAQD VANAASEMLT
QFPGFTMDMI PELQKRAREQ SIVDAEKEMR LEQEREAARI AEARRHPDEL TQEERFARVR
GVGEKTIEQL KVAGYGSVEA VHNESDVMRL AESSGLGIKK ARQLKHAVGV YLEEEVKLRA
ELDAERAKAA QGGAGA