Gene Rpal_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4721 
Symbol 
ID6412407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5081176 
End bp5082735 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content67% 
IMG OID642714600 
Producthypothetical protein 
Protein accessionYP_001993687 
Protein GI192293082 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.358451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCC CAGACGCCAG CTCCCCCGGC TCCCAGCAGC AAGACGTCTT CGACTTCCTG 
GGCCGCGGGG CAGGCGACGC GCCCGTGGTG CAGATCGACA CCCATGGCGC AGCGGTTTTT
CTCGAGGGGA ACCGGGCGCT GAAGATCAAG CGCGCCGTCA AGTTCCCGTT TCTGGACTAT
TCAACCCTCG CCAAACGAAA GATCGCCTGC GAGCAGGAGC TCGAGGTCGG CCACCGATTC
GCGCCGACGA TCTATCGGCG TGTCGTTCCG ATCACCCGCA CTGACAAGGA AGCGTTACAG
ATCGGCGGCG AAGGACCAGC CGTGGAATGG GCCGTCGAGA TGATGCGCTT CGACGACAGC
GCCACGCTGG ATCATCTCGC CCGCGCCGGT TCGCTCGGTC CTGAGCTGAT CGACGCCGTC
GCCGATGCGA TCGCCGCCTC GCATCAGGCA GCGCCTTTGG CGGCGACTGC ACCATGGGTC
GCATCGATCG AACCGATCCT GGCCGACGAC ACCAACGAGC TTGCCGCAGG CGGTTTTGCC
GCCGCCGACG TCGCGGCGCT CGACAACGGC AGCCGCAATG CGCTCGGCCG GCTGCGCCCG
TTGCTGGAGC AGCGCGGTGT GGCCGGCTTC GTTCGCTGGT GCCACGGCGA TCTGCACCTC
GCCAATATCG TGGTGATCGA CGGCAAGCCG ACGCTGTTCG ACGCCATCGA ATTCGATCCG
GCGCTCGCCT CGGTCGACGT GCTGTACGAT CTCGCCTTCC CGCTGATGGA CCTGCTGCAT
TACGGCCGCG GCAGCGACTC CGCACAACTT TTGAACCGCT ATCTCGCGGT GACGAACGCG
GACAATCTCG ATGCGCTGTC GACGCTGCCG TTGCTGCTGT CGATGCGCGC TGCGATCCGC
GCCAAGGTGA TGCTGGCACG ACCCGCGGCC GATGAGACGA TCAGGCGAGC CAATCGGGCG
ATTGCCGAAT CCTATTTCGA GCTGGCACTG CGGCTGATCG CGCCGCCCCG GCCCCGGCTG
ATCGCGGTCG GCGGGCTGTC GGGCACCGGC AAGTCAGTGC TGGCTCGCGC TCTCTCCAGC
AACGTCCCGC CCCTGCCCGG TGCCGTGGTG CTGCGCTCGG ATGTGGCCCG CAAACGGCTG
CACGGCGTCG CCGACACTGA ACGGCTCCCG GCAACAGCCT ACACCACTGA AGTGACGGAG
GCGGTGTATC GCGGTCTGGC TGAGCGCGCC GCGCATATCT TGAAACAGGG ACATTCGGTG
ATCGTCGATG CGGTGTTCTC CAAGCCCGAG GAGCGCGACG CGATCGAAAG CGTCGCGGCC
GGGCTTGGCA TCCCATTCCA CGGGCTGTTT CTCACCGCCG ATCTCGCCAC GCGGGTCGCG
CGAGTCGCAG GCCGTACCGC AGATGCGTCC GATGCGACGC CGGAGATCGT CCGGCAGCAG
CAAAGCTACG CGCAAGGCGT GATCGGCTGG ACCTCGATCG ACGCCGGCGG CACTCCGGCC
GAGACGCTGT CGCGGGCGGT GGCGGCGTTG CCGCAGACCG CTCAGGTCTG CAGCACGTAG
 
Protein sequence
MPAPDASSPG SQQQDVFDFL GRGAGDAPVV QIDTHGAAVF LEGNRALKIK RAVKFPFLDY 
STLAKRKIAC EQELEVGHRF APTIYRRVVP ITRTDKEALQ IGGEGPAVEW AVEMMRFDDS
ATLDHLARAG SLGPELIDAV ADAIAASHQA APLAATAPWV ASIEPILADD TNELAAGGFA
AADVAALDNG SRNALGRLRP LLEQRGVAGF VRWCHGDLHL ANIVVIDGKP TLFDAIEFDP
ALASVDVLYD LAFPLMDLLH YGRGSDSAQL LNRYLAVTNA DNLDALSTLP LLLSMRAAIR
AKVMLARPAA DETIRRANRA IAESYFELAL RLIAPPRPRL IAVGGLSGTG KSVLARALSS
NVPPLPGAVV LRSDVARKRL HGVADTERLP ATAYTTEVTE AVYRGLAERA AHILKQGHSV
IVDAVFSKPE ERDAIESVAA GLGIPFHGLF LTADLATRVA RVAGRTADAS DATPEIVRQQ
QSYAQGVIGW TSIDAGGTPA ETLSRAVAAL PQTAQVCST