Gene RPB_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1212 
Symbol 
ID3910147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1385868 
End bp1386953 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content69% 
IMG OID637883106 
Productchorismate synthase 
Protein accessionYP_484833 
Protein GI86748337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.535517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA ATACCTTCGG CCACATGTTT CGTGTCACCA CGTTCGGCGA AAGCCACGGG 
GTGGCGATCG GCTGCGTGGT CGACGGCTGT CCGCCGCTGA TCCCGCTGAC CGAGGCCGAC
ATCCAGGGCG ACCTCGACCG CCGCCGGCCG GGGCAGTCGC GCTTCACCAC GCAGCGCCAG
GAAGCCGACC AGGTGAAGAT CCTGTCCGGC GTGATGGCGC ATCCGGAGAC CGGCGTGCAG
GTCACCACCG GGACGCCGAT CGCGCTCTTG ATCGAGAACA CCGACCAGCG CTCCAAGGAT
TATTCCGAGA TCCAGAACAA GTTTCGGCCC GGCCATGCCG ACTTCACCTA TGAGGCGAAA
TACGGCATCC GCGACTATCG CGGCGGCGGC CGCTCCTCGG CGCGCGAGAC CGCGACCCGC
GTCGCCGCCG GCGCGGTGGC GCGCAAGGTG ATCGCCGGCA TGACCGTGCG CGGCGCGCTG
GTGCAGATCG GCCCGCACCA GATCGACCGC GACAAATGGG ACTGGGCCGA GATCGGCAAC
AACCCGTTCT TCTGCCCCGA CAAGGACAAG GCGGCGTTCT TCGCCGATTA TCTCGACGGC
ATCCGCAAGA GCGGCTCGTC GATCGGCGCG GTGATCGAAG TGGTCGCCGA AGGCGTGCCC
GCGGGCCTCG GCGCGCCGAT CTACGCCAAG CTCGACACCG ACCTCGCCGC GGCGCTGATG
AGCATCAACG CGGTCAAGGG CGTCGAGATC GGCGACGGCT TCGCCACCGC GGCGCTGACC
GGCGAGGAGA ACGCTGACGA GATGCGGATG GGCAATGCCG GCCCGCAATT TCTGTCGAAC
CATGCGGGCG GCATTTTGGG CGGCATCTCC ACGGGGCAGC CGGTGGTGGC GCGGTTCGCG
GTGAAGCCGA CCTCGTCGAT CCTGTCGCCG CGCAAGACCA TCGATCGCGC CGGCCACGAC
ACCGATATCC TGACCAAGGG CCGCCACGAC CCCTGCGTCG GCATCCGCGC GGTCCCGGTC
GGCGAGGCGA TGGTCGCCTG CGTGCTGGCC GATCATCTGC TGCGCCACCG CGGGCAGGTC
GGCTAG
 
Protein sequence
MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPLIPLTEAD IQGDLDRRRP GQSRFTTQRQ 
EADQVKILSG VMAHPETGVQ VTTGTPIALL IENTDQRSKD YSEIQNKFRP GHADFTYEAK
YGIRDYRGGG RSSARETATR VAAGAVARKV IAGMTVRGAL VQIGPHQIDR DKWDWAEIGN
NPFFCPDKDK AAFFADYLDG IRKSGSSIGA VIEVVAEGVP AGLGAPIYAK LDTDLAAALM
SINAVKGVEI GDGFATAALT GEENADEMRM GNAGPQFLSN HAGGILGGIS TGQPVVARFA
VKPTSSILSP RKTIDRAGHD TDILTKGRHD PCVGIRAVPV GEAMVACVLA DHLLRHRGQV
G