Gene PMN2A_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1149 
Symbol 
ID3606539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1637078 
End bp1638598 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content38% 
IMG OID637688022 
Productanthranilate synthase, component I 
Protein accessionYP_292342 
Protein GI72382987 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATT TAGATAAGGA AGAATTTATT TTGTCAGTGT CCAGCGGGGC TAATTATATT 
CCGTTGGCAA AAAGTTGGCC GGCAGATTTA GAAACTCCTC TTACAACCTG GCTTAAAGTT
GGTAATGATG CTCCTTCAGG AGTATTGCTT GAATCAGTAG AGGGTGGAGA AACTATCGGT
AGGTGGAGTG TGGTTGCATC AGATCCCCTT TGGAAAGTAG TAGTAAGGGG CGATGAATTA
ACTAGATGCT GGAGGGATGG AAAACAAGAA AAGTTTCATG GAAATCCAGT GGAAATCCTC
AGGAAAATGC TTGAGCCTTA TAAATCTGTT TCTTTGCCTG GCTTGCCACA ACTGGGACAA
CTTTTTGGCA TGTGGGGATA TGAATTAATT CAATGGATAG AGCCCTCAGT GCCTACTTAT
GAATTATCAG ATCAAGACTT ACCTGATGGT ATTTGGATGT TTATGGATAA AGTTCTGATT
TTTGATCAAG TCAAACGCCT AATAACAGCT GTTGCATATG GGAATTTAAG TGATGGAGTT
TCTTCTCAAA AAGCTTATGA AATTGCCTGT GAACAAATCA ATGAACTGCA AGATTTAATG
GCTTCTCCTT TAAAGCCAAT AAAGTCTTTA AAGTGGAATC AAAGATCGAA TAGATCTATT
GATATGGCTG CTAATACCTC AAAAAGTGAA TTTGAACATA GTGTTGAAGC GGCAAAAGAA
TTTATTAAAC AAGGCGATAT TTTTCAGTTA GTTCTTAGTC AAAAATTGGA GTCGACTGTT
ACGCAAAAAC CCTTTGAACT ATATCGAAGC CTGAGGATGG TAAATCCCTC TCCATTTATG
GCGTTTTTTG ACTTTGGTGA CTGGCAACTT ATTGGTTCTA GCCCGGAGGT AATGGTTAAG
GCCCAAAAAA CAGAAAAGGG TATTCAGACA AGTTTGAGAC CAATTGCAGG TACACGACCT
AGAGGTAAAA ATGATTTGGA AGATGCAGCC TTAGAAAAAG ATCTTTTAAA AGATCCCAAA
GAACGAGCAG AACATGTGAT GTTGGTAGAT TTGGGTCGAA ATGATTTAGG TCGAGTTTGT
ACCCCAGGTA GTGTTGTTGT GAAAGAATTA ATGGTTATTG AAAAATATTC GCATGTAATG
CATATCGTAA GTGAGGTTGA AGGCACTTTA AAAAAAGAAC AGGATGTTTG GGACTTATTA
ATTGCTTCTT TCCCAGCTGG GACTGTAAGT GGAGCCCCAA AAATAAGAGC AATGCAACTA
ATTAATCAAT TAGAAAATCA ACGTAGAGGG CCTTATTCAG GCGTTTATGG GTCTATAGAT
TTAAATGGAG CATTAAATAC AGCTATTACT ATTAGAACGA TGATTGTACG TAAAAAAAAC
AAAAATGGTT TTACTGTTGA AGTGCAAGCA GGGGCAGGGG TTGTTGCAGA TTCCATTCCT
TCTAATGAGT ATCAAGAAAC TTTAAATAAA GCTAAAGGGA TGTTTACTGC TTTAGCTTGC
TTAGACCCCC AAGATTTATG A
 
Protein sequence
MLNLDKEEFI LSVSSGANYI PLAKSWPADL ETPLTTWLKV GNDAPSGVLL ESVEGGETIG 
RWSVVASDPL WKVVVRGDEL TRCWRDGKQE KFHGNPVEIL RKMLEPYKSV SLPGLPQLGQ
LFGMWGYELI QWIEPSVPTY ELSDQDLPDG IWMFMDKVLI FDQVKRLITA VAYGNLSDGV
SSQKAYEIAC EQINELQDLM ASPLKPIKSL KWNQRSNRSI DMAANTSKSE FEHSVEAAKE
FIKQGDIFQL VLSQKLESTV TQKPFELYRS LRMVNPSPFM AFFDFGDWQL IGSSPEVMVK
AQKTEKGIQT SLRPIAGTRP RGKNDLEDAA LEKDLLKDPK ERAEHVMLVD LGRNDLGRVC
TPGSVVVKEL MVIEKYSHVM HIVSEVEGTL KKEQDVWDLL IASFPAGTVS GAPKIRAMQL
INQLENQRRG PYSGVYGSID LNGALNTAIT IRTMIVRKKN KNGFTVEVQA GAGVVADSIP
SNEYQETLNK AKGMFTALAC LDPQDL