Gene NATL1_02381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02381 
SymboltrpB 
ID4779575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp220171 
End bp221424 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content42% 
IMG OID640083503 
Producttryptophan synthase subunit beta 
Protein accessionYP_001014067 
Protein GI124024951 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGTA CACTCCCAAC ACAATTTAAA GATTCGGATT TATCTCCTTT AACAAGGCCA 
AATGCTCTTG GTCGATTTGG TAAGTATGGG GGGCAATATG TTCCTGAAAC TTTAATACCT
GCTCTTATTG AATTAGAGCA AGCTGCTAAA GAAGCATGGA AAGATTCTTC ATTCAATTCA
GAACTAAATC ATTTACTAAA AACATACGTA GGAAGATCAA CTCCTCTTTA TGAAGCCACA
AGACTAACTG AACATTACAG AAAAAATACA TTAAAAGGTC CAAGGATTTG GCTTAAAAGA
GAAGATTTAA ATCACACAGG CGCACACAAA ATAAATAATG CACTTGGGCA AGCTCTTCTT
GCGATTCGGA TGGGAAAAAA AAGAATTATT GCTGAAACTG GGGCTGGTCA GCATGGAGTT
GCAACTGCGA CAGTCTGCGC TCGTTTTGGA TTGGAGTGCA TTATCTATAT GGGCAAAGAA
GATATGAGAA GACAAGCCTT AAACGTATTC CGAATGCAAT TGCTTGGAGC CTCAGTGAGG
CCAGTAACAA GTGGAACAGC AACACTCAAA GATGCAACCA GCGAAGCTAT TCGGGATTGG
GTTACTAATG TTGAAACGAC TCATTATATT CTCGGTTCAG TTGCAGGCCC ACATCCATAT
CCAATGTTGG TCAGAGATTT TCACGCAGTT ATTGGAGAAG AAACAAAGCA ACAATGTAAA
CAAGCTTTTG GGCGGTCTCC CGATGTTCTT CTTGCTTGTG TTGGTGGGGG ATCGAATGCG
ATGGGTCTTT TCCATTCTTT TGTTGAAGAC AAAAGTGTGA GAATGATTGG AGTTGAAGCT
GCGGGAGATG GAGTCGAAAC AGGTCGCCAT GCAGCGACAA TTACTGAAGG AAGAGTAGGA
GTTCTCCACG GCGCGATGAG TCTCTTACTA CAAGACAAAG ATGGGCAAGT TGAGGAGGCT
CATTCCATTA GCGCAGGTCT TGATTATCCA GGGGTTGGGC CGGAGCATAG TTACTTAAAA
GAAATTGGAC GTGCTGAATA TGCTGCTGTT ACTGACACTG AAGCCATAGA AGCGCTGCAA
TTAGTGAGTA AATTGGAAGG TATTATTCCT GCTCTAGAGA CTGCTCATGC ATTTGCCTAT
CTAGAAAAAC TTTGCCCAAC TCTCAATCAT AATTCTGAAA TTGTCATTAA CTGCTCTGGC
AGAGGGGATA AAGACGTGAA TACAGTTGCT GAAAAGCTAG GATCAGAAAT ATAA
 
Protein sequence
MTGTLPTQFK DSDLSPLTRP NALGRFGKYG GQYVPETLIP ALIELEQAAK EAWKDSSFNS 
ELNHLLKTYV GRSTPLYEAT RLTEHYRKNT LKGPRIWLKR EDLNHTGAHK INNALGQALL
AIRMGKKRII AETGAGQHGV ATATVCARFG LECIIYMGKE DMRRQALNVF RMQLLGASVR
PVTSGTATLK DATSEAIRDW VTNVETTHYI LGSVAGPHPY PMLVRDFHAV IGEETKQQCK
QAFGRSPDVL LACVGGGSNA MGLFHSFVED KSVRMIGVEA AGDGVETGRH AATITEGRVG
VLHGAMSLLL QDKDGQVEEA HSISAGLDYP GVGPEHSYLK EIGRAEYAAV TDTEAIEALQ
LVSKLEGIIP ALETAHAFAY LEKLCPTLNH NSEIVINCSG RGDKDVNTVA EKLGSEI