Gene A9601_17841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17841 
Symbol 
ID4718518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1517744 
End bp1519264 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content34% 
IMG OID640079514 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_001010174 
Protein GI123969316 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.633527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCT CACAAAAAGA TAGTTTTTTA AAGGCTTACA AAGAAGGTAA AAACTTTATA 
CCTATAGTTG AAACTTGGCC AGCAGATTTA GAGACTCCAT TATCGACTTG GTTAAAATTA
TCTTCAAAAG ATTCCCATGG TGTTTTTCTT GAATCTGTTG AGGGTGGCGA GAATTTGGGT
AGGTGGAGTA TTGTTGCTAC TCAACCTCTT TGGGAAGCCG TTTGTTATGG AGAAGAAATA
ATTAAAACTT GGAATAATGG CAAAACTGAA ACACATAAAG GTGATCCTTT TGATATTTTG
AGAAGTTGGA CAAACGAATA CAAGTCAACC ACGCTTGATG AATTACCCTC AATTGGACAG
TTATATGGCT CTTGGGGTTA TGAATTAATA AATCGAATAG AACCAAGCGT TCCAATAAAT
GAAAAATTAG AAAACAATAT CCCTTATGGT TCCTGGATGT TTTTTGATCA GATAGTTGTT
TTTGATCAAA TAAAAAGATG TATTACTGCA GTGGTTTATG CAGATACAAC TTCTACAAAA
GAGTGCGAAA TTGAACTGTT GTACCTAAAC TCAATTTCTA GAATTAAGAA AACTAGAAAT
TTAATGAGAG TTCCTCTAAA AGAAAATGAG TTTTTAGATT GGAATGAAAA TGAGAATTTG
AATTTAGATC TAGAAAGTAA TTGGGAGAAA AAAGATTTTG AGGATGCAGT TCTCTCTGCA
AAAGAATATA TAAGAAAGGG AGATATCTTC CAAATAGTTA TTAGTCAGAG ATTCCAAACT
CAAGTCAATA ATGATCCCTT TAATTTATAT AGAAGTCTGA GAATGGTTAA TCCATCTCCA
TACATGTCAT TTTTTGATTT TGGCTCATGG TATCTGATAG GTTCAAGTCC TGAAGTCATG
GTTAAAGCAG AAAAAAATAA AAATAGTCAG ATCGTTGCAA GCTTAAGACC AATAGCTGGC
ACTAGACCTA GAGGTATTGA TAATCAGCAA GACTTGGAAT TAGAAAAGGA ATTATTAAAA
GATCCAAAAG AGATAGCTGA GCATGTAATG CTAATTGATC TTGGGAGAAA TGATCTTGGA
AGAGTTTGTG AAATTGGTAC TGTCAAGGTC AAGGATTTAA TGGTTATTGA GAAATATTCA
CATGTTATGC ATATAGTCAG TCAAGTTGAG GGAATCTTAA AAAATAATGC TGATGTATGG
GATTTGCTCA AAGCATCCTT TCCCGCTGGG ACAGTAACTG GCGCTCCAAA AATAAGAGCT
ATGCAATTGA TTAAGCACTT TGAAAAAGAT GCTAGAGGAC CTTATGCAGG TGTATACGGA
TCTATTGATA TTAATGGCGC ATTAAATACA GCAATTACAA TAAGAACTAT GATAGTAAAA
CCCTCAATAG ATGGGAAATA TGATGTTTCA GTGCAAGCAG GAGCTGGAAT AGTTGCTGAT
TCTTTTCCTG AAAATGAATA TCAAGAGACG ATAAATAAAG CAAAGGGAAT ACTAAAAGCA
CTAGCCTGTT TGGATAAATA A
 
Protein sequence
MISSQKDSFL KAYKEGKNFI PIVETWPADL ETPLSTWLKL SSKDSHGVFL ESVEGGENLG 
RWSIVATQPL WEAVCYGEEI IKTWNNGKTE THKGDPFDIL RSWTNEYKST TLDELPSIGQ
LYGSWGYELI NRIEPSVPIN EKLENNIPYG SWMFFDQIVV FDQIKRCITA VVYADTTSTK
ECEIELLYLN SISRIKKTRN LMRVPLKENE FLDWNENENL NLDLESNWEK KDFEDAVLSA
KEYIRKGDIF QIVISQRFQT QVNNDPFNLY RSLRMVNPSP YMSFFDFGSW YLIGSSPEVM
VKAEKNKNSQ IVASLRPIAG TRPRGIDNQQ DLELEKELLK DPKEIAEHVM LIDLGRNDLG
RVCEIGTVKV KDLMVIEKYS HVMHIVSQVE GILKNNADVW DLLKASFPAG TVTGAPKIRA
MQLIKHFEKD ARGPYAGVYG SIDINGALNT AITIRTMIVK PSIDGKYDVS VQAGAGIVAD
SFPENEYQET INKAKGILKA LACLDK