Gene Rpal_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0442 
SymbolnusA 
ID6408090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp476678 
End bp478291 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content65% 
IMG OID642710354 
Producttranscription elongation factor NusA 
Protein accessionYP_001989478 
Protein GI192288873 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTCA GCGCCAATAG GCTGGAACTG CTGCAGATCG CCGATGCGGT GGCTCGCGAG 
AAAACCATCG ACCGCAGCAT CGTGATTGCG GCGATGGAAG ATGCGATCGC CAAGGCGGCG
CGCGCCCGCT ACGGCTCGGA GACCGACGTC CATGCTGAGA TCGACCCGAA GAAGGGCGAG
CTGCGGCTGT CGCGCCACAT GCTGGTGGTC GAGCAGGTCG AAAACCCCGC CAACCAGATT
TCGCTGAAGG ACGCGCAGCG CGCCAATCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC
ACCCTGCCGC CGCTGGAATA CGGCCGTATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG
CAGAAGGTCC GTGAGGCCGA GCGCGACCGC CAGTACATGG AATTCAAGGA CCGGATCGGC
GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGACCTTGGC
CGCGGCGAAG CGATCATCCG CCGCGACGAG ATGCTGCCGC GTGAGTCGTT CCGCAACGGC
GACCGCGTCC GCGCCTATGT GTTCGACGTC CGCCGCGAGA CCCGCGGCCC GCAGATCTTC
CTGTCGCGCA CCCATCCGCA GTTCATGGCC AAGCTGTTCG CGCAGGAAGT GCCGGAAATC
TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CCGGCTCGCG CGCCAAGATC
GGCGTCGTCT CGCGGGACTC CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGT
TCGCGCGTGC AGGCGGTGGT GAACGAACTG CAGGGCGAGA AGATCGACAT CATTCCGTGG
TCGCCGGACA TCGCCACCTT CGTGGTCAAC GCGCTGGCGC CGGCCGAAGT CTCGAAGGTC
GTGATCGACG AAGATCGCGA ACGCATCGAG GTTGTGGTTC CGGATACCAA TAACCAACTA
TCCCTGGCGA TTGGTCGCCG CGGTCAGAAC GTGCGGCTCG CTTCGCAGCT CACCGGCTGG
GACATCGACA TCCTGACCGA GAGCGAGGAA TCCGAGCGCC GCCAAGCCGA CTTCGAGAAG
ACCACCCGGG CCTTCATGGA CGCGCTGAAC GTCGACGAGG TCGTCGGCCA GCTGCTCGCC
TCCGAAGGTT TCACCTCGGT CGAAGAACTG GCGCTGGTCG ACCCGCGCGA ACTCGCCTCG
ATCGAAGGTT TCGACGAGGA AACCGCCGCC GAACTGCAGA CCCGCGCCAG CGAATATCTC
GACCGGATTG AATCCGAGCT CGAGGCCCGG CGCCTGGAGC TCGGCGTCGA AGATGCTCTG
AAGGACGTTC CCGGCGTCAC CTCGAAGATG CTGGTCAAGT TCGGCGAGAA CGACGTCAAG
ACCGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TGGGTTGGAC CGAGCGCAAG
GACGGCGCCG AGCCGGTGAA GTATCCTGGC ATTCTCGACG GCATGGAGAT GTCGCGCGAG
GACGCCGAAC ACCTGATCAT GCAGGCCCGC GTCAAGGCCG GCTGGATCGA CGAGTCGGAG
CTCGCCTCCG AAGAAGAACC CGCGGACGAA GCGTCCGACG AGTCGGCGGA CTGA
 
Protein sequence
MAVSANRLEL LQIADAVARE KTIDRSIVIA AMEDAIAKAA RARYGSETDV HAEIDPKKGE 
LRLSRHMLVV EQVENPANQI SLKDAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV
QKVREAERDR QYMEFKDRIG DIVNGVVKRV EYGSVIVDLG RGEAIIRRDE MLPRESFRNG
DRVRAYVFDV RRETRGPQIF LSRTHPQFMA KLFAQEVPEI YDGIVEIKAV ARDPGSRAKI
GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV
VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTESEE SERRQADFEK
TTRAFMDALN VDEVVGQLLA SEGFTSVEEL ALVDPRELAS IEGFDEETAA ELQTRASEYL
DRIESELEAR RLELGVEDAL KDVPGVTSKM LVKFGENDVK TVEDLAGCAT DDLVGWTERK
DGAEPVKYPG ILDGMEMSRE DAEHLIMQAR VKAGWIDESE LASEEEPADE ASDESAD