Gene Rpal_5031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5031 
Symbol 
ID6412725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5412344 
End bp5413633 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID642714916 
Producthypothetical protein 
Protein accessionYP_001993995 
Protein GI192293390 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC TGACCATCAC CTGCCCGAAC TGCGCCTCGT CGGTGCCACT GACGGAGTCC 
CTGGCGGCGC CGCTACTGAA GGATACGCAG GCCAAATACG AGCGGCTGAT CAAGCAGAAG
GATCAGGACA TCGCCGGGCG CGAGCAGGCG CTGCGGGCGC AGCAGGCGGA GGTCGAGAAG
GCCAAGGCGG CGGTGGCACA GCAGGTCGCC GACCAGGTGA CGGCGGCGCG GGCGCGGATC
GCCGCGGAGG AAGCCGCCAA GGCAAAGCGC CTCGCCGAAA ACGATCTCGC CGACAAGGCG
CGGCAGCTCG CCGAGCTACA GGAGGTGCTG AAAAGCCGAG ACGCCAAGCT CGCCGAGGCG
CAGCAGGCGC AGGCGGAGTT TGTGAAAAAG CAGCGGCTGC TCGAGGACGA GAAGCGCGAG
CTTCATCTGA CGATCGAGAA GCAGGTCCAA GCCGGCCTCG ATGAAGCGCG GCAGAAGGCC
CAGCAGGCCG CCGAAGATAA TCTGCGGCTC AAGGTCACCG AGAAAGAAGA GCAGATCGCC
GCGATGCAGC GGCAGATCGA GGATCTGAAG CGCAAGGCCG AGCAGGGCTC GCAGCAATTG
CAGGGCGAGG TGCTGGAGCT CGAACTCGAA GCCTCGCTGC GCGCCAAGTT TCCGCACGAC
CAGATCGAGC CGGTGCCGAA GGGCGAATTC GGCGGCGACG TGCTGCAGCG GGTGGTGAGC
GCCGCGGCGC AGCCGTGCGG CAGCATCCTG TGGGAATTCA AGCGCACCAA GAATTGGTCG
GACGGCTGGC TGACCAAGCT GCGCGACGAC CAGCGCAAGG CCAAGGCCGA GCTGGCCCTG
ATCGTCTCCA ACGCGTTGCC GAAGGGCGTG CACACCTTCG ACCATATCGA CGGCGTCTGG
GTCACCGAAG CGCGCTGCGC GATTCCGGTG GCAATCGCGC TGCGGCAGTC GCTGATCGAG
CTCGCCGCCG CGCGCCAGGC CGGCGTCGGC CAGCAGACCA AGATGGAGCT GACCTACCAG
TACCTCACCG GTCCCGCATT CCGGCAGCGG ATCGAGGCGA TCGTCGAGAA GTTCACCGAG
ATGCAGAGCG ATCTCGACAA GGAGCGTCGC TCGATGATGC GGATGTGGGC CAAGCGCGAG
GCGCAGATCC GCGGCGTGCT CGAGGCCACC GCCGGGATGT ACGGCGATCT GCAGGGCATC
GCCGGCAAAG CGCTGGCCGA GATCGACGGC ATGGCGCTGC CGATGCTGGA AGACTTCAGC
GACGACGACG GCGACAGCGA AGCGGCGTAA
 
Protein sequence
MTDLTITCPN CASSVPLTES LAAPLLKDTQ AKYERLIKQK DQDIAGREQA LRAQQAEVEK 
AKAAVAQQVA DQVTAARARI AAEEAAKAKR LAENDLADKA RQLAELQEVL KSRDAKLAEA
QQAQAEFVKK QRLLEDEKRE LHLTIEKQVQ AGLDEARQKA QQAAEDNLRL KVTEKEEQIA
AMQRQIEDLK RKAEQGSQQL QGEVLELELE ASLRAKFPHD QIEPVPKGEF GGDVLQRVVS
AAAQPCGSIL WEFKRTKNWS DGWLTKLRDD QRKAKAELAL IVSNALPKGV HTFDHIDGVW
VTEARCAIPV AIALRQSLIE LAAARQAGVG QQTKMELTYQ YLTGPAFRQR IEAIVEKFTE
MQSDLDKERR SMMRMWAKRE AQIRGVLEAT AGMYGDLQGI AGKALAEIDG MALPMLEDFS
DDDGDSEAA