Gene P9301_01901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_01901 
Symbol 
ID4912848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp174686 
End bp175831 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content31% 
IMG OID640159756 
ProductNifS-like aminotransferase class-V 
Protein accessionYP_001090414 
Protein GI126695528 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCAA CTCCCATACT ACTAGACTAT CAATCTTCGA CTCCTTGCTC TAAAGATGTC 
GTTGATTCTA TGAAACCTTT TTGGAGTGAG ATATTTTCTA ACCCTGCAAA TAAATCTAAT
TTGGCTGGGA TTAACGCAAG CGCTATATTG GAAGCCTCAA GAGAAAAAAT AGAACAAAGT
TTATTTCTTA AGAATAAAAA AGTTATTTTT ACAAGTGGGG CAACTGAATC TAATAACTTA
GCCTTATTAG GTTTTGCTAG AAATTTCTAT AAAAAAACAG GAAATTATGG ACATATTATT
ACCTTAAAAA CAGAGCATAA AGCTGTTTTG GAGCCCCTTA ACCAACTTAA AAAAGAGGGA
TTTATGGTTA CAGAAATTAA TCCTGAGAAA GATGGCTTAA TTTCAGAAGA ACAATTCAAA
AAAAATATAA GAGAAGATAC ATTTCTGGTT AGTGTCATGT TGGCAAATAA CGAAATTGGA
GTTATTCAGC CTCTAGAGAA TATTTCAAAG ATATGTAAAT CGAGGGGAAT AACTTTGCAC
TCTGATTTCG CACAATGTTT AGGTTATATC GAGTTAGACA ATCTTTTATC AGACGTAAAT
ATGATAACGA TTAGTTCTCA CAAAATATAT GGTCCTAAAG GGATAGGACT TCTTTTGATT
GATGAAGAAA TTAATCTTGA GCCTTTAATT GTTGGAGGAG GTCAGGAATA TGGTCTTAGG
TCTGGCACAT TACCTCTTCC TCTAGTAGTT GGCTTTGCTA AAGCAATAGA GATAGCAGTT
CTTAATCAAA AAAATAATGC TGAGAAATTA CTTTTTTATA GAAATAATCT TTTAGAGGGG
TTGTTAAAAA ATAATTCTGG TTTAATAATT AATGGCTCAA TAGAAAAAAG ATTACCTCAT
AATTTAAATT TGACTGTATT GGATTTAAAC GGAGCAAAGT TTCATAAACT TTTAAAATCT
AAAATAATTT GTTCTACTGG ATCTGCATGT AGTAGTGGTC AACCATCTCA TGTCTTACTA
GCCTTAGGTA GATCTCTGAA AGAAGCAGAA TCTTCAATAA GGTTAAGTAT TGGATTAAGT
ACTAATTCAA AAGATATAAA ACAAGCAATT CATATTCTTA CAAATACAAT CAGATCATTA
CGATAG
 
Protein sequence
MLSTPILLDY QSSTPCSKDV VDSMKPFWSE IFSNPANKSN LAGINASAIL EASREKIEQS 
LFLKNKKVIF TSGATESNNL ALLGFARNFY KKTGNYGHII TLKTEHKAVL EPLNQLKKEG
FMVTEINPEK DGLISEEQFK KNIREDTFLV SVMLANNEIG VIQPLENISK ICKSRGITLH
SDFAQCLGYI ELDNLLSDVN MITISSHKIY GPKGIGLLLI DEEINLEPLI VGGGQEYGLR
SGTLPLPLVV GFAKAIEIAV LNQKNNAEKL LFYRNNLLEG LLKNNSGLII NGSIEKRLPH
NLNLTVLDLN GAKFHKLLKS KIICSTGSAC SSGQPSHVLL ALGRSLKEAE SSIRLSIGLS
TNSKDIKQAI HILTNTIRSL R