Gene P9303_29681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29681 
Symbol 
ID4778490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2621771 
End bp2623186 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content55% 
IMG OID640088492 
Productputative p-aminobenzoate synthetase 
Protein accessionYP_001018963 
Protein GI124024656 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACAGC TGCCGCATCC GCGATGCAGC TTTGAGCGAG GCTGGCAGAT CAGATCTCTG 
CAGCAAGCCG GCACCATGAC TTCTCTGGAG CGGCGCCTTT GCCGCTGGCA AGAACCAGCC
ATGTTGGCAA AGCAACTTAC CAACACTTGG GGAGAAGCTG GGCTTATCTG GCTGGATGGA
GACGGCAGCG ATCTGGGTCG TTGGGCGACT CTGGCAGTCG ATCCCATTAA TCAGATTTGT
TGCCGAGGCA TCCCTGGCGA AAAAGGCGCC AGTAACCCTT TTGCAGCACT GCGTGATCTA
GAACCAGGCC ATTGGACAGG CTGGCTCAGC TACGAAGCAG CAGCCTGGAT AGAACCCAAA
AATCCCTGGA AAGCAGATTC CATGGCGACT CTGTGGATGG CGCGTCACGA TCCCATCCTG
CGATTCGACC TTCAGAAACG CCAACTCTGG ATCGAAGGGT GTCATCCCAA ACGACTGCAA
GAACTGGCTA ACTGGCTAGA AGCCAACCCA TCTGAAGATG CACCTAAAAG CAGCAAAGAT
GAGCCGTTAC TTGCAACAAC AGACCTCAAA ATTCCAGTCA ATGCATGGGA ATGGCTCACC
ACTAGAGCTG ACTATGCACG TGATGTGCAA CAGATCAGAC ATTGGATTGC CAGCGGCGAT
ATTTTCCAGG CCAATCTCAG CGCTTGCTGT ACCACCACGA TCCCTTCAGG AAGCTTCGCC
GTAGACCTTT TCCTAAAACT GCGTCACCAC AACCCAGCTC CCTTTGCCGG CCTAGTGATA
GCTGCAGGCC TTGCAAAAGG TGAAGCCGTG ATCTCCGCCT CTCCAGAGCG TTTCCTAAAA
GCACTCCCCA CAGGAGAAGT TGAAACACGA CCAATCAAAG GAACCCGACC ACGCCACCCC
AACCAAAGCC AAGACGCTGA TCTGGCAGCT GATCTTGTTT GCAGCAGCAA AGACCGAGCT
GAAAACGTGA TGATCGTCGA TCTATTGCGC AACGATCTAG GGCGAGTCTG TCAACCCGGC
TCCATCACAG TGCCTCAATT GGTAGGCCTT GAGAGCTATC CCCATGTGCA TCACCTCACC
TCAGTAGTGC AAGGACGGCT TCGATCAGAC CAATCTTGGG TAGACCTACT ACAAGCCTGT
TGGCCGGGAG GCTCCATCAG CGGTGCCCCC AAGCTGCGCG CCTGTCAGCG GCTGAACGAA
CTGGAACCAA CAGCACGCGG GCCCTATTGC GGCTCGTTGT TGCATCTCAA CTGGGATGGG
CAGCTCGACA GCAGCATTCT GATTCGCTCG ATGTTGCTCG AGGGCAACAC CTTACGAGCC
CATGCCGGCT GCGGCATCGT TACCGGTTCC GACCCCTATT GCGAAGCTGA TGAGCTGAAC
TGGAAACTGC TGCCACTACT GGAGGCACTG CAATGA
 
Protein sequence
MRQLPHPRCS FERGWQIRSL QQAGTMTSLE RRLCRWQEPA MLAKQLTNTW GEAGLIWLDG 
DGSDLGRWAT LAVDPINQIC CRGIPGEKGA SNPFAALRDL EPGHWTGWLS YEAAAWIEPK
NPWKADSMAT LWMARHDPIL RFDLQKRQLW IEGCHPKRLQ ELANWLEANP SEDAPKSSKD
EPLLATTDLK IPVNAWEWLT TRADYARDVQ QIRHWIASGD IFQANLSACC TTTIPSGSFA
VDLFLKLRHH NPAPFAGLVI AAGLAKGEAV ISASPERFLK ALPTGEVETR PIKGTRPRHP
NQSQDADLAA DLVCSSKDRA ENVMIVDLLR NDLGRVCQPG SITVPQLVGL ESYPHVHHLT
SVVQGRLRSD QSWVDLLQAC WPGGSISGAP KLRACQRLNE LEPTARGPYC GSLLHLNWDG
QLDSSILIRS MLLEGNTLRA HAGCGIVTGS DPYCEADELN WKLLPLLEAL Q