Gene P9211_16971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16971 
Symbol 
ID5730045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1523422 
End bp1524942 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content40% 
IMG OID641286079 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_001551582 
Protein GI159904238 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.623423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATAT CAGATCAGGA TTCATTTTTA GAAGCAGCTT CTAGAGGATT AACTTTTATT 
CCACTGGTTC ACAGTTGGCC TGCAGACCTT GAGACACCAT TATCAACATG GTTGAAGGTT
GGTGAAGGCC ATCCTCCAGG GGTTCTTCTT GAATCTGTAG AGGGAGGCGA AACTCTTGGG
AGATGGAGTG TAGTTGCTAC CGATCCTCTT TGGATAGCAA CTGCTAGAGG GAATAGTCTC
AAAAGGGAGT GGCGTGACGG ACAATGCGAC GAAATACAAG GCAATCCTTT TGAAGTTATT
AGGGAATGGC TTCTTCCATA TCGTACTGAG CCTATTGACG GCTTACCTTG CATAGGTCAA
TTGTATGGAA TATGGGGTTA TGAATTGATT CAATGGGTAG AGCCAAAAGT TTCAGTGTTT
GCAAGGACTA AAAGTGATCC TCCTGATGGA GTGTGGATGT TTATGGATAG AGTTTTAATT
TTTGATCAGG TTAAAAGAGT GATTAATGCA GTCTCGTATG GAGATTTGAC ATGCAATGAT
CAACCACTTC AGGCATATGA AAAGGCTGCA CAAAGAAGCA AAGATTTGCA AGTTCTTTTG
CAATCCCCAC TTCCTTCGCT GAAACCTCTT CAATGGCAAT CAACAACTGA GACTCCAAAC
TCAGTAAAGA GTAATACCAC TCAAGTCAAA TTTAAGAATG CAGTTAAATC AGCTAAGGAA
TACATAAAAA AAGGAGATAT TTTTCAAATT GTCCTTAGTC AGAAATTAAG AACTCAGGTT
CCTAATAAAC CTTTTGAGAT TTATCGAAGT TTGCGCATGG TGAATCCTTC GCCGTTTATG
GCTTTTTTTG ATTTTGGTGA TTGGCAACTT ATTGGATCTA GTCCTGAAGT GATGGTTCAA
GCAAAGCCCA GTGAAAAAGG TATCTATGCA AGCTTGAGAC CTATTGCAGG TACAAGACCT
AGAGGTATCA ATGAAATGGA AGATAAAACA TTAGAACGCG AATTATTATC TGATCCAAAA
GAAATAGCAG AGCATGTAAT GCTAGTGGAT TTAGGACGTA ATGATTTGGG CAGGGTTTGT
CGATCTGGGA CTGTTGAAGT TAAAGAGTTG ATGGTGATTG AAAAGTATTC TCATGTGATG
CATATTGTTA GTGAAGTAGA AGGAATGCTT AGAGAAGATA AGGATGTATG GGATCTACTA
ATGGCAGCTT TCCCTGCTGG CACGGTTTCT GGAGCACCTA AGATAAGAGC AATGCAGTTA
ATCAATGAAT TAGAAACTCA GCCTCGAGGA CCATATTCAG GGGTATATGG ATCAATGGAT
TTAAATGGAG CATTAAATAC AGCAATTACC ATTAGAACTA TGGTTGTATC CTCTCATTCA
AACAATATTT CAAATGTGCA AGTTCAAGCA GGTGCAGGTG TAGTTGCTGA CTCAATCCCT
GCAAATGAAT TCCAAGAAAC TATGAATAAA GCTAAAGGCT TGCTCACTGC ACTAGGATGT
CTTGAGCGGT CTGATTCATG A
 
Protein sequence
MHISDQDSFL EAASRGLTFI PLVHSWPADL ETPLSTWLKV GEGHPPGVLL ESVEGGETLG 
RWSVVATDPL WIATARGNSL KREWRDGQCD EIQGNPFEVI REWLLPYRTE PIDGLPCIGQ
LYGIWGYELI QWVEPKVSVF ARTKSDPPDG VWMFMDRVLI FDQVKRVINA VSYGDLTCND
QPLQAYEKAA QRSKDLQVLL QSPLPSLKPL QWQSTTETPN SVKSNTTQVK FKNAVKSAKE
YIKKGDIFQI VLSQKLRTQV PNKPFEIYRS LRMVNPSPFM AFFDFGDWQL IGSSPEVMVQ
AKPSEKGIYA SLRPIAGTRP RGINEMEDKT LERELLSDPK EIAEHVMLVD LGRNDLGRVC
RSGTVEVKEL MVIEKYSHVM HIVSEVEGML REDKDVWDLL MAAFPAGTVS GAPKIRAMQL
INELETQPRG PYSGVYGSMD LNGALNTAIT IRTMVVSSHS NNISNVQVQA GAGVVADSIP
ANEFQETMNK AKGLLTALGC LERSDS