Gene GSU3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3108 
Symbolrho 
ID2688442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3410940 
End bp3412187 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content59% 
IMG OID637127801 
Producttranscription termination factor Rho 
Protein accessionNP_954149 
Protein GI39998198 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTGC AGGAGCTTAA AGGGAAAAAA ATCAACGAAC TGGCTGCCAT TGCCAAGGGG 
TTGAACATTG AGGGCGCGTC CAGTCTGCGC AAGCAGGACC TGATCTTCGC CATTCTCAAC
GCCCAGACCG AAAAGAACGG CATGATCTTC GGCGAGGGGG TTCTGGAGTG CCTTCCCGAC
GGCTTCGGCT TTCTGAGGGC GCCGGATTAC AACTACCTGC CGGGCCCCGA CGACATCTAC
GTATCGCCGT CGCAGATCCG CCGCTTCAAC CTCCATACGG GCGATACCGT TTCGGGGCAG
ATCCGTCCTC CCAAGGAGGG TGAGCGCTAT TTCGCTCTTC TCAAGGTGGA GACGGTCAAC
TTCGAGCCCC CTGAGGTCGC CCGCGACAAG ATTCTGTTCG ACAACCTGAC TCCTCTCTAT
CCCGACGAAA AGCTCAAACT CGAAACCGCG CCGGACAATA TGTCGATGCG CGTCATGGAG
CTTGTCTCCC CTATCGGCAA GGGGCAACGG GGGCTCATCG TGGCGCCGCC GCGTACCGGC
AAGACGATGC TCATTCAGAA CATCGCCAAT TCCATTGCCG AAAATCACCC CGAAGTCTAT
CTGATCGTGC TGCTCATCGA CGAACGGCCC GAAGAGGTGA CCGACATGCA GCGCTCGGTC
AAGGGGGAAG TAGTCTCCTC CACCTTTGAC GAACCGGCCA CCCGCCACGT GCAGGTGGCT
GAGATGGTCA TCGAAAAGGC CAAGCGCCTG GTGGAGCACA AGCGCGACGT GGTGATTCTC
CTCGACTCCA TAACCCGTCT TGCCCGGGCT TACAATACGG TGCTTCCTCC TTCCGGCAAA
ATCCTCACCG GCGGGGTTGA CGCCAACGCC CTCCAGAAGC CCAAGCGCTT TTTCGGCGCT
GCCCGCAATA TCGAAGAGGG GGGGTCTCTG ACCATCATCG CGTCGGCCCT GGTGGATACC
GGCAGCAAGA TGGATGAGGT TATCTTCGAA GAGTTCAAGG GGACCGGAAA CATGGAGGTT
CACCTGGACC GCAAACTGGT GGAGAAGCGG ACCTTCCCGG CCATCGACAT CAACAAGTCC
GGCACCCGCA AGGAAGAGCT GCTGGTGGAG AAGAGCGCGC TCAACCGCAT CTGGATTCTG
CGCAAGGTCC TCCATCCCAT GAACGTGGTC GACAGCATGG AGTTCCTCTT GGAGAAGCTC
TCCGAGACCA AGGACAATCA GGCGTTTCTC GACTCCATGA GCAGGTAG
 
Protein sequence
MNLQELKGKK INELAAIAKG LNIEGASSLR KQDLIFAILN AQTEKNGMIF GEGVLECLPD 
GFGFLRAPDY NYLPGPDDIY VSPSQIRRFN LHTGDTVSGQ IRPPKEGERY FALLKVETVN
FEPPEVARDK ILFDNLTPLY PDEKLKLETA PDNMSMRVME LVSPIGKGQR GLIVAPPRTG
KTMLIQNIAN SIAENHPEVY LIVLLIDERP EEVTDMQRSV KGEVVSSTFD EPATRHVQVA
EMVIEKAKRL VEHKRDVVIL LDSITRLARA YNTVLPPSGK ILTGGVDANA LQKPKRFFGA
ARNIEEGGSL TIIASALVDT GSKMDEVIFE EFKGTGNMEV HLDRKLVEKR TFPAIDINKS
GTRKEELLVE KSALNRIWIL RKVLHPMNVV DSMEFLLEKL SETKDNQAFL DSMSR