Gene Rpal_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1394 
Symbol 
ID6409051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1466905 
End bp1467993 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID642711293 
Productchorismate synthase 
Protein accessionYP_001990409 
Protein GI192289804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.187693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTCA ATACCTTCGG CCATCTATTT CGCGTCACCA CCTTTGGCGA AAGCCATGGG 
GTGGCGATCG GCTGCGTGGT TGACGGCTGC CCGCCGCTGA TCCCGCTGAC CGAGGCCGAT
ATCCAGGGCG ATCTCGACCG CCGCCGGCCC GGCCAGTCGC GCTTCACCAC CCAGCGCCAG
GAAGCCGATC AGGTGAAGAT CCTGTCCGGC GTGATGGTGC ATCCCGAGAC CGGCGTGCAG
GTGACGACCG GCACCCCGAT CGCGCTGTTG ATCGAGAATA CCGACCAGCG CTCCAAGGAC
TATTCGGACA TCCAGAACAA GTATCGCCCC GGCCACGCCG ACTTCACCTA CGAGGCGAAG
TACGGCATCC GCGACTATCG CGGCGGTGGC CGCTCCTCGG CGCGCGAGAC CGCGACCCGG
GTCGCCGCAG GCGCGATCGC CCGCAAGGTG ATTGCCGGCA TGACCGTGCG CGGCGCGCTG
GTGCAGATCG GTCCGCACAA GATCGACCGT GACAAATGGG ATTGGGACGA GATCGGCAAC
AACCCGTTCT TCTGCCCGGA CAAGGACAAG GCGGCGTTCT ACGCCGACTA TCTCGACGGC
ATCCGCAAAT CCGGCTCGTC GATCGGCGCG GTGGTGGAGA TCGTGGCCGA GGGCGTGCCG
GCCGGGCTCG GTGCGCCGAT CTATGCCAAG CTCGACGGCG ACCTCGCCGC AGCGCTGATG
AGCATCAATG CGGTCAAGGG CGTCGAGATC GGCGACGGCT TCGCCAGTGC CGAACTGACC
GGCGAACAGA ACGCCGACGA GATGCGGACC GGCAATCATG GTCCGGCTTT CCTGTCGAAC
CATGCCGGCG GCATCCTGGG CGGCATTTCC ACCGGCCAGC CGGTGGTGGC GCGGTTCGCC
GTCAAGCCGA CCTCGTCGAT CCTGACCCCG CGCAAGACCG TCGATCGCAC CGGCCACGAC
ACCGAGATTC TCACCAAGGG CCGCCACGAC CCCTGCGTCG GCATCCGCGC CGTGCCGGTC
GGCGAGGCGA TGGTCGCTTG CGTGCTGGCC GACCACCTGC TGCGGCACCG GGGACAGGTC
GGCGGCTGA
 
Protein sequence
MSFNTFGHLF RVTTFGESHG VAIGCVVDGC PPLIPLTEAD IQGDLDRRRP GQSRFTTQRQ 
EADQVKILSG VMVHPETGVQ VTTGTPIALL IENTDQRSKD YSDIQNKYRP GHADFTYEAK
YGIRDYRGGG RSSARETATR VAAGAIARKV IAGMTVRGAL VQIGPHKIDR DKWDWDEIGN
NPFFCPDKDK AAFYADYLDG IRKSGSSIGA VVEIVAEGVP AGLGAPIYAK LDGDLAAALM
SINAVKGVEI GDGFASAELT GEQNADEMRT GNHGPAFLSN HAGGILGGIS TGQPVVARFA
VKPTSSILTP RKTVDRTGHD TEILTKGRHD PCVGIRAVPV GEAMVACVLA DHLLRHRGQV
GG