Gene P9303_22721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22721 
Symbol 
ID4778652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2006956 
End bp2008476 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content54% 
IMG OID640087790 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_001018272 
Protein GI124023965 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.267297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGCT CTGATCGTGA CCATTTCTTT GAGATGGCTG CTAGTGGTGC CAATTTCATT 
CCTTTGGCCC ACAGTTGGCC AGCGGATCTG GAGACCCCTC TCACAACTTG GTTAAAAGTT
GGGGCAGACC ATCCCCCTGG GGTTTTACTT GAATCGGTCG AAGGGGGTGA AACTCTTGGG
CGCTGGAGTG TGGTTGCCTG CAATCCACTT TGGACTGCCA CATGCCGAGG GAAACACCTC
ACACGTCGTT GGCGAGAAGG ACGAACAGAT GAAGCCATCG GCAACCCTTT TGAAGGCCTC
AGGCAATGGC TAGCTCCTTA TCGCACCGCA ACCCTTCCAG GCCTACCCCC CCTTGGTCAG
CTCTATGGAA TGTGGGGTTT TGAACTGATC AAGTGGGTTG AACCCACAGT GCCCGTTCAC
TTAAGGGACA ACAACGATCC GCCTGATGGC ATCTGGATGC TGATGGACAG CATCTTGATC
ATTGATCAAG TCAAACGCCT CATCACTGCC GTTGCATACG CAGACCTGAG TGGCGAGCAA
ACGGCTAACG AAGCTTGGGA CAAGGCACAA GCACGCATTC AAGACCTAGA AAAGTGCATG
GCGGAACCAC TTGCACCGAT TCAGCCACTG AAATGGCAAC CAAAAGGTCA ATCTCCACCT
TCCACCATCA GTAACTACAG CCAAAAAGGC TTTGAGGAGG CAGTTCAAAC GGCCAAGCAA
CACATCGCCG CAGGGGATGT GTTCCAGCTT GTGATCAGTC AAAGGCTGGA GACCAGAGTT
CCTCAACAGC CACTTGAGCT CTACCGAAGT CTGCGGATGG TGAATCCTTC TCCATATATG
GCTTTCTTTG ACTTCGGCGA CTGGCAGCTG ATTGGCTCAA GCCCGGAGGT CATGGTCAAG
GCGGAGCCAG TCGTCGATGG CATTAAGGCC AGCCTTCGGC CTATTGCCGG CACGCGTCCG
CGTGGCGGCA ACGAACTTGA GGACCGCAAT CTTGAAGCAG AGTTGATGGC AGATCCCAAG
GAACGTGCCG AGCATGTGAT GTTGGTTGAT CTTGGCCGCA ATGACCTTGG ACGCGTTTGC
AGGCCGGGCA GTGTGACGGT GAAAGAGCTG ATGGTGATCG AGAAATATTC CCACGTCATG
CACATCGTCA GTGCAGTGGA AGGTGTGCTT GCCAAAGGCA AGGATGTTTG GGATCTACTC
ATGGCCTCAT TCCCAGCAGG CACGGTCAGT GGCGCCCCAA AAATCAGAGC CATGCAGCTC
ATTCATGACC TCGAACCCGA CTCACGAGGA CCTTATTCAG GTGTTTATGG GTCCATCGAT
CTCAATGGTG CCCTGAATAC AGCTATAACC ATTCGCACAA TGATTGTGCG GCCCCATCCT
GAAGGCGGCT GGCAAGTCAA GGTTCAAGCA GGCGCTGGTG TGGTGGCCGA TTCCATCCCC
ACCAAGGAAT ACGAAGAGAC CCTCAACAAG GCAAGGGGAA TGCTCACAGC CCTGGCCTGC
CTCGAGTCCC ACAAGTCATG A
 
Protein sequence
MLSSDRDHFF EMAASGANFI PLAHSWPADL ETPLTTWLKV GADHPPGVLL ESVEGGETLG 
RWSVVACNPL WTATCRGKHL TRRWREGRTD EAIGNPFEGL RQWLAPYRTA TLPGLPPLGQ
LYGMWGFELI KWVEPTVPVH LRDNNDPPDG IWMLMDSILI IDQVKRLITA VAYADLSGEQ
TANEAWDKAQ ARIQDLEKCM AEPLAPIQPL KWQPKGQSPP STISNYSQKG FEEAVQTAKQ
HIAAGDVFQL VISQRLETRV PQQPLELYRS LRMVNPSPYM AFFDFGDWQL IGSSPEVMVK
AEPVVDGIKA SLRPIAGTRP RGGNELEDRN LEAELMADPK ERAEHVMLVD LGRNDLGRVC
RPGSVTVKEL MVIEKYSHVM HIVSAVEGVL AKGKDVWDLL MASFPAGTVS GAPKIRAMQL
IHDLEPDSRG PYSGVYGSID LNGALNTAIT IRTMIVRPHP EGGWQVKVQA GAGVVADSIP
TKEYEETLNK ARGMLTALAC LESHKS