Gene Rpal_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4453 
Symbol 
ID6412137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4784148 
End bp4785755 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content66% 
IMG OID642714335 
Producthypothetical protein 
Protein accessionYP_001993424 
Protein GI192292819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.74365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATCG ACGGCGTAAG CGGGCGTACT TCCTACATCG GCACCGGGAT CCTCAATCTC 
CGCAGCCAGC TCGAGAATCT GAGCCAACAG CTCTCGAGCG GGCGGATCTC GAACACCTAT
GCCGGCGATG GCACCGGCCG CAGCCTTGCG ATCGGCCTGC GCGCGCAGAT GTCGAACATC
GCCAGCTACA GCGACACGAT GACCAACATC ACCACCCGCA TCAGCGTGGC GAACCTGTCG
CTGCAGCGGA TGGCAACGAT CAACAGCGAG GTGAAGGGCG CGGCGGTGAG CGCCGGCTCG
ACGCTGGACA ACACCGGTCA AACCCCGGGC CAGAAGACCG CGTCACTCGA CTTCTTCGAC
TCGGTCGACA TGCTGAACGC CCAGGTCGGC GACCGCTATC TGTTCGGCGG CCGGATCACC
GACACTGCGC CGGTGACCGC GGCCGACAAG ATCATGAACG GCGACGGCGC GGCGGTCGCC
GGGCTGAAGC AGGTGATCAG CGAGCGCCAC GACGCCGATG TCGGCACCAA CGGCATGGGC
CGGACCATCG TGTCGACCGG CGCGACCGCC ACGACCGTCC AGATCGGTGA GGACTTCGCC
GCCAATCCGT ACGCGACTCC GACCCCGATC GGGCCGTCGC CGTTCGGCAT GAAGCTGAAT
GCGGTTTCGA CCACGATCAG CGGCGCGGTG GTGACGCAGC CGACCGAGAC GCCGCCGACC
ACCCCGCCGG CGGCCCCCAA TCCCAAGGCG ATGTCGATCG ACCTCAACGG CGTCATTCCG
AACGAGGGCG ACGCGGTGAA GTTCACCTTC GACATGCCGG ATGGGACGCA GGAGACCATC
ACGCTGACGG CGTCTTCAAA GACGCCGCTG CCGGACGGAT GCTTCGCGAT CGATCAGGGC
TCGCCGACCG CAGTGCCCCC CGTCGCGCCG TCACCGTCGG TGACGGCGGC CAATCTGCAG
ACCGCGCTGA CGGCTGCGGT GAAGAAGATG GCCAACGGCC CGCTGGCGGC CGCCTCGGCG
ATCAAGGCCG GCGACGACTT CTTCAACAAC ACCCCGCCGC TGCGGGTGGC CGGCACCGCG
CCGTTCGGCG CCGCCACGGC GCAAGTCGCC GGCACCAAAG CCAATACGAT CTTCTGGTAC
AACGGCGAGC CGGACTCGGC GAGCGATCCG GCCCGCGGCA CCGCGGTTGC CAAGATCGAT
GACGCCATCA CGGTGCAATA CGGCGCCCGA GCCGATGAGC AGGCGCTGCG CAAGCAGTTG
CAGACCGTTG CGGTGTTCGC CGCGGTGACG ACCTCGGCCA CCGATCCCTA TAGTTCCGGA
AAGATTGCGG CGCTCAACCA GCGCGTGGCT GCTAATCTGG CGGTCGTTCC CGGCCAGCAG
TCGATCCAGA ATATGCAGGC GGAATTGGCC GGTGCTCAGG CGTCGATCAA GGCCACCGCC
AACCGCCAGA CGCAGAGTAA GGCGCTGGCG CAGACGATGC TCAGTTCGAT CGAAGGCATC
AACAATGATG AGGTGGCGAC CAAGATCCTG GCGCTGCAAA CCTCGCTGCA GGCGTCGTAT
GAGACGACGT CGAAACTCTA TCAGCTGAGC CTCGTCAAGT TTCTCTAG
 
Protein sequence
MAIDGVSGRT SYIGTGILNL RSQLENLSQQ LSSGRISNTY AGDGTGRSLA IGLRAQMSNI 
ASYSDTMTNI TTRISVANLS LQRMATINSE VKGAAVSAGS TLDNTGQTPG QKTASLDFFD
SVDMLNAQVG DRYLFGGRIT DTAPVTAADK IMNGDGAAVA GLKQVISERH DADVGTNGMG
RTIVSTGATA TTVQIGEDFA ANPYATPTPI GPSPFGMKLN AVSTTISGAV VTQPTETPPT
TPPAAPNPKA MSIDLNGVIP NEGDAVKFTF DMPDGTQETI TLTASSKTPL PDGCFAIDQG
SPTAVPPVAP SPSVTAANLQ TALTAAVKKM ANGPLAAASA IKAGDDFFNN TPPLRVAGTA
PFGAATAQVA GTKANTIFWY NGEPDSASDP ARGTAVAKID DAITVQYGAR ADEQALRKQL
QTVAVFAAVT TSATDPYSSG KIAALNQRVA ANLAVVPGQQ SIQNMQAELA GAQASIKATA
NRQTQSKALA QTMLSSIEGI NNDEVATKIL ALQTSLQASY ETTSKLYQLS LVKFL