Gene P9301_17681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_17681 
Symbol 
ID4911892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1491404 
End bp1492924 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content34% 
IMG OID640161369 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_001091992 
Protein GI126697106 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCT CACAGAAAAT TAGTTTTTTA AAGGCTTATA AAGAAGGTAA AAATTTTATA 
CCTATAGTAG AAACTTGGCC AGCTGATTTG GAGACTCCAT TATCAACTTG GTTAAAATTA
TCTTCAAAAG ATTCCCATGG TGTTTTTCTT GAATCTGTTG AAGGTGGGGA GAATTTGGGC
AGGTGGAGTA TTGTTGCTAC TAAACCTCTT TGGGAGGCAG TTTGTTATGG AGAAGAAATA
GTTAAAACTT GGAATAATGG CAAAACTGAA ACACATAAAG GTGATCCTTT TGATATTTTA
AAAAGTTGGA CAATGGAATA TAAGTCAACC ATGCTTGAAG AATTACCATC AATTGGACAG
TTATATGGCT CTTGGGGTTA TGAATTAATA AATCGAATAG AGCCAAGCGT TCCAATAAAT
GAAATGCCAG AAAACAATAT CCCTCAAGGT TCCTGGATGT TTTTTGATCA ATTAGTTGTT
TTTGATCAAA TAAAAAGATG TATTACTGCG GTGGTTTATG CAGATACAAC TTCTTCTCAA
GAGTCTTCAA TTGAAGAGTT GTATCTAAAC TCAATATCTA AAATTCAGGA AACTAGAAAT
TTAATGAGAG TTCCTCTAAA AGAAAATGAG TTTTTAGATT GGAATGAAAA TGATAATTTA
AATTTAGATT TAGAAAGTAA TTGGGAGAAA AAAGATTTTG AGGATGCAGT TCTCTCTGCA
AAAGAATACA TAAGAAAGGG AGATATCTTC CAAATAGTTA TAAGTCAAAG ATTTCAAACT
CAAGTCAATA ATGATCCCTT TAACTTATAT AGAAGTCTGA GGATGGTTAA TCCATCTCCT
TACATGTCAT TTTTTGATTT CGGCTCATGG TATCTGATAG GTTCAAGTCC TGAAGTAATG
GTTAAAGCAG AAAAAAATAA AAATAGTCAG ATTGTTGCAA GCTTAAGACC AATAGCTGGC
ACGAGACCAA GAGGTATTGA TAATCAACAA GACTTGGAAT TAGAAAAGGA ATTATTAAAG
GATCCCAAAG AGATAGCTGA GCATGTAATG TTGATTGATC TTGGAAGAAA TGATCTTGGA
AGAGTTTGTG AAATTGGTAC TGTCAAGGTA AAGGATTTAA TGATTATTGA GAAATATTCA
CATGTTATGC ATATAGTCAG TCAAGTTGAG GGAATCTTAA AAAATAATGC TGATGTATGG
GATTTGCTCA AAGCATCCTT TCCTGCTGGG ACTGTGACTG GCGCCCCAAA AATAAGAGCT
ATGCAATTGA TTAAGCATTT TGAAAAAGAT GCTAGAGGAC CTTATGCTGG TGTATACGGA
TCTATTGATA TTAATGGTGC ATTAAATACA GCAATTACGA TAAGAACCAT GATAGTTAAA
CCCTCAAGAG ATGGGAAATA TGATGTGTCA GTGCAAGCAG GAGCTGGAAT AGTTGCTGAT
TCTTTTCCTG AAAATGAATA TCAAGAGACG ATAAATAAAG CAAAGGGAAT ACTTAAAGCA
TTAGCCTGTT TGGATAAATA A
 
Protein sequence
MISSQKISFL KAYKEGKNFI PIVETWPADL ETPLSTWLKL SSKDSHGVFL ESVEGGENLG 
RWSIVATKPL WEAVCYGEEI VKTWNNGKTE THKGDPFDIL KSWTMEYKST MLEELPSIGQ
LYGSWGYELI NRIEPSVPIN EMPENNIPQG SWMFFDQLVV FDQIKRCITA VVYADTTSSQ
ESSIEELYLN SISKIQETRN LMRVPLKENE FLDWNENDNL NLDLESNWEK KDFEDAVLSA
KEYIRKGDIF QIVISQRFQT QVNNDPFNLY RSLRMVNPSP YMSFFDFGSW YLIGSSPEVM
VKAEKNKNSQ IVASLRPIAG TRPRGIDNQQ DLELEKELLK DPKEIAEHVM LIDLGRNDLG
RVCEIGTVKV KDLMIIEKYS HVMHIVSQVE GILKNNADVW DLLKASFPAG TVTGAPKIRA
MQLIKHFEKD ARGPYAGVYG SIDINGALNT AITIRTMIVK PSRDGKYDVS VQAGAGIVAD
SFPENEYQET INKAKGILKA LACLDK