Gene RPC_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0478 
SymbolnusA 
ID3970240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp514898 
End bp516523 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content64% 
IMG OID637923594 
Producttranscription elongation factor NusA 
Protein accessionYP_530372 
Protein GI90422002 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.779193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.288054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTG TCAGCGCCAA CAAGCTCGAA CTGCTTCAGA TTGCGGACGC GGTTGCGCGC 
GAGAAATCGA TCGATCGCTC GATCGTGATC GCCGCGATGG AAGACGCCAT CGCCAAGGCG
GCGCGGGCCC GTTACGGCTC CGAAACCGAC GTTCACGCCG AGATCGACGC CAAGAAGGGC
GAACTCAGGC TGTCCCGCCA CATGTTGGTG GTCGAGGAGG TCGAGAACTC CTCGAACCAG
ATTTCGCTGA AGGACGCCCA GCGCGCCAAC CCCGGCGCGC AGATCGGCGA CACCATCGCC
GACACCCTGC CGCCGTTGGA ATACGGCCGG ATCGCCGCGC AGTCGGCCAA GCAGGTGATC
GTGCAGAAGG TGCGCGAGGC CGAGCGCGAC CGGCAATATC AGGAATTCAA GGACCGCATC
GGCGACATCG TCAACGGCGT GGTGAAGCGC GTCGAATACG GCAGCGTGAT CGTCGACCTC
GGCCGCGGCG AGGCCATCGT GCGGCGCGAC GAGATGCTGC CGCGCGAAGT GTTCCGCAAC
GGCGACCGGG TCCGCGCCTA TATCTTCGAC GTCCGCCGCG AAACCCGCGG CCCGCAGATC
TTCCTCTCGC GCACCCATCC GCAGTTCATG GCCAAGCTGT TCGCGCAGGA AGTGCCGGAA
ATCTACGACG GCATCGTCGA GATCAAGGCG GTGGCCCGCG ATCCCGGCTC GCGCGCCAAG
ATCGGCGTGA TTTCCAGGGA TTCCTCGGTC GATCCGGTCG GCGCCTGCGT CGGCATGCGC
GGCTCGCGCG TCCAGGCGGT GGTCAACGAA CTGCAGGGCG AGAAGATCGA CATCATCCCG
TGGTCGCCGG ACATCGCCAC CTTCGTGGTC AATGCGTTGG CACCCGCCGA AGTCTCGAAA
GTGGTGATCG ACGAAGATCG TGAGCGGATC GAGGTTGTGG TCCCGGACAC CAATAACCAA
TTATCCCTTG CGATCGGTCG GCGCGGGCAA AACGTCCGGC TGGCTTCGCA GCTCACCGGA
TGGGATATCG ACATTCTGAC GGAGACCGAG GAATCGGAGC GCCGCCAGGC CGATTTCGAG
AATTCCACCC GTGTCTTCAT GGAATCGCTG AACGTCGACG AAGTGGTCGG CCAGCTGTTG
GCGTCGGAAG GCTTCACCTC GGTCGAGGAG CTGGCCATGG TCGACGTCCG CGAACTCGCT
TCCATCGAAG GCTTCGACGA CGAGACCGCG AACGAACTGC AGAGCCGGGC CCGCGAATAT
CTTGAACAGC TCGAATCCGA GCTCGAAGCC AAACGTAAGG AACTCGGTGT GGAAGACGCT
TTGAAGACGG TGCCAGGCGT GACCTCGAAG ATGCTGGTGA AGTTCGGCGA GAACGACATC
AAGACCGTCG AGGACCTGGC CGGCTGCGCC ACCGACGACC TGGTGGGCTG GAGCGAGCGC
AAGGAAGGCG GCGAGCCGGT CAAGTTCCCG GGCATTCTCG ACGCCAACGA GATTTCGCGC
GCCGACGCCG AAACGCTGAT CATGCAGGCC CGCGTCATCG CCGGCTGGAT CACCGAAGCC
GACCTCGCCA AGACTGCCGA CGCCACCGCC GACGCCGACG AGGCGTCGGA AGACCAGCCG
GTTTAG
 
Protein sequence
MAAVSANKLE LLQIADAVAR EKSIDRSIVI AAMEDAIAKA ARARYGSETD VHAEIDAKKG 
ELRLSRHMLV VEEVENSSNQ ISLKDAQRAN PGAQIGDTIA DTLPPLEYGR IAAQSAKQVI
VQKVREAERD RQYQEFKDRI GDIVNGVVKR VEYGSVIVDL GRGEAIVRRD EMLPREVFRN
GDRVRAYIFD VRRETRGPQI FLSRTHPQFM AKLFAQEVPE IYDGIVEIKA VARDPGSRAK
IGVISRDSSV DPVGACVGMR GSRVQAVVNE LQGEKIDIIP WSPDIATFVV NALAPAEVSK
VVIDEDRERI EVVVPDTNNQ LSLAIGRRGQ NVRLASQLTG WDIDILTETE ESERRQADFE
NSTRVFMESL NVDEVVGQLL ASEGFTSVEE LAMVDVRELA SIEGFDDETA NELQSRAREY
LEQLESELEA KRKELGVEDA LKTVPGVTSK MLVKFGENDI KTVEDLAGCA TDDLVGWSER
KEGGEPVKFP GILDANEISR ADAETLIMQA RVIAGWITEA DLAKTADATA DADEASEDQP
V