Gene GSU1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1887 
SymbolrpoN 
ID2688508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2065151 
End bp2066596 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content55% 
IMG OID637126578 
ProductRNA polymerase sigma-54 factor 
Protein accessionNP_952936 
Protein GI39996985 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATTG AAATGCGCCA ACAGATGAAG ATGACCCAGC AGCTGGTGAT GACGCCCCAG 
TTGCAGCAGG CCATCAAACT TCTCCAGCTG TCACGGCTGG AGCTTCAGGA TCTGGTCCGT
CAGGAGATGG AAGAGAATCC GCTTCTGGAA GAGTCTCAGG AGGCTGAAGA GGTCAAGGAA
CAGGATCTGG TCGAACTTTC CGAAAAGGAG GATGCTCCCG CTCCGGCTGA ACAGGATTTT
CGCGAGGTCA CCACGGGGGA AGAGACCCGG GAGTCGGACT GGGACAGCTA CCTTGAGGGT
TACAATTACA GTTCCGGCGA ACAGTACTAC GATGATGAAG ATCGTCCATC CTACGAAAAC
ATTCTGACAC GCAAAGGGAC CCTTGCCGAT CATCTCATGT GGCAGCTCAA TATGACCAAG
TTGAGCGATG AAGAGGCACG GGCCAGCGCG GAGATTATCG GCAACATAGA TGAAGACGGC
TACCTGCGCG CTACGGTTGA GGAAGTCGCC CTGGCATGCT CCGTAAGCGA ATCCGTGGCG
GAATCGGCTT TGAAGAAGAT CCAGGAGTTT GACCCCATGG GGGTGGGGGC TCGTAGCCTG
AGGGAATGTC TCCTGATCCA GGTGGAGCAG CTCGGCATGG CGGGGAGCGT TGTGGACGGC
ATCCTCCGCA ATCACCTCCA CGACTTGGAG ACCCGCAAGT ACAAACAAAT TGCCAAGTCT
CTCGGGGTCG ACGTGGACAG CATCCTGATG GCCGCCAAGA TTATCGCCGG CCTGGACCCG
AAACCGGGCC GAGTCTATGG CTCCGAAGAT GTCCACTACA TTTCGGCCGA TATATTTGTC
TATAAAATCG CTGACGACTA TGTCGTCGTG CTTAATGACG AAGGCCTTCC CAATCTGAGA
ATCAGCCCCT TTTATGCCGG CGAGATCAAA AACGGTGCCG CCGTCGACGC CAAGGCCGAG
GAGTACATCA ACGAAAAGTC CCGCTCCGCC ATGTGGCTCA TCAAGAGCAT CCACCAGCGG
CAGAGAACCA TCTACAAGGT TGCGAAGAGC ATTGTGAAGT TCCAGCGCGA GTTTCTCGAC
CGCGGGATAG AACACCTTCG CCCCCTGGTG CTCAGGGACG TAGCCGAGGA TATCGGCATG
CACGAGTCGA CCATCAGCCG CGTCACCACC AACAAGTACA TGCAAACCCC CCAGGGGCTC
TTCGAGATGA AGTACTTTTT CAATAGCGGC ATTTCAACTA CCGAAGGAGA TTTCATCGCT
TCAGAGAGCG TCAAGAACAA GATCAAGGAA ATCGTTGATT CCGAGGATCC CCGAAAGCCG
TACAGCGACC AGCGTATCGC CGAATTGCTT TCGGCGCACA CCATCAACAT CGCCCGACGA
ACCGTAACGA AATACCGCGA GATGTTGAAA ATCGGCTCGT CATCCGAGCG CAAACGTCAT
TTTTGA
 
Protein sequence
MAIEMRQQMK MTQQLVMTPQ LQQAIKLLQL SRLELQDLVR QEMEENPLLE ESQEAEEVKE 
QDLVELSEKE DAPAPAEQDF REVTTGEETR ESDWDSYLEG YNYSSGEQYY DDEDRPSYEN
ILTRKGTLAD HLMWQLNMTK LSDEEARASA EIIGNIDEDG YLRATVEEVA LACSVSESVA
ESALKKIQEF DPMGVGARSL RECLLIQVEQ LGMAGSVVDG ILRNHLHDLE TRKYKQIAKS
LGVDVDSILM AAKIIAGLDP KPGRVYGSED VHYISADIFV YKIADDYVVV LNDEGLPNLR
ISPFYAGEIK NGAAVDAKAE EYINEKSRSA MWLIKSIHQR QRTIYKVAKS IVKFQREFLD
RGIEHLRPLV LRDVAEDIGM HESTISRVTT NKYMQTPQGL FEMKYFFNSG ISTTEGDFIA
SESVKNKIKE IVDSEDPRKP YSDQRIAELL SAHTINIARR TVTKYREMLK IGSSSERKRH
F