Gene P9303_01081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01081 
Symbol 
ID4776122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp107923 
End bp109293 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content38% 
IMG OID640085607 
Producthypothetical protein 
Protein accessionYP_001016128 
Protein GI124021821 
COG category 
COG ID 
TIGRFAM ID[TIGR03573] N-acetyl sugar amidotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.244523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCT ACCCCTTCCC AAAAGAAACG GATAAAGCTC TCTACAGAGA TGCAGTAAGT 
AATGATGCGT ACTATGGATT GCCACAACCA GTTAGCTTCT GCAAATTATG TGTTATTAGT
AATCAACGTC CCTCAAGCAC GATTGAATTT AAAAATAACG GTACAAAGCC TAAAACAGTA
ATACAATTTT CAAAGGACGA AATATGTGAT GCCTGTCGAG CTAAGAAGCA AAAATCGGCA
ATTGATTGGG ATGAAAGGGC AAAGGAGCTG AAAAATCTAT GTGATCGCTT TAGAAAAACA
GATGGAGGAT ACGATTGCTT AATTCCAGGG AGTGGGGGAA AGGACAGTTT TATGCAAGCA
CATATTTTAA AGTATGAATA TGGGATGAAT CCATTAACAT GTACTTGGGC ACCAAATATT
TATACTGATT GGGGCTGGAA AAATCATCAG GCATGGATTC ATGCAGGATT TGACAATATA
CTTTTTACAC CGAATGGAAA AGTTCATCGG CTGATCACAA GATTGGCAGT AGAAAATCTC
TTTCACCCTT TTCAACCTTT TATTCTAGGG CAAAAAAATC TCGGACCAAA GATTGCAGAT
TTATATAATA TTAATCTCGT CTTTTATGGT GAAAACGAGG CTGAGTATGG GAATCCTATG
GCTGATATGT CCTCTGCACT AAGAAACTGG GAGTATTTTA CCGCTTCTAA TGAAGATGAA
ATTTATTTAG GAGGAGCATC ACTCAGTGAA CTCAGAGAAT TAGGATTGAA GGATAGTGAT
TGGGAGATAT ACCTACCAAT CGACCCTAAG ATTATCAGTA AAAAGCAAAT AGAAGTGCAC
TATCTTGGTT ATTATAAAAA ATGGCATCCC CAGGCTGCTT ATTATTACGC GATTGCGCAT
GGCAATTTTC AGAGTTCGCC AGAAAGAACA ATAGGCACAT ACAGTACATA CAATTCAATT
GATGATAAGA TCGATGATTT CCATTATCAT ACAACTTTTA TCAAATTTGG AATAGGGCGT
GCTACATATG ATGCTTCACA AGAAATACGT TCAGGGGATC TAGTAAGAGA AGAAGGGGTT
GCATTGGTGA AAAAGTTTGA TGGAGAATAT CCTGAAAGGT GGGCGGATGA AATATTTAAA
TACCTAAGTC TTCCTATGAA TGAATTCCCA ATAGCATCAA AGATGTTTGA AGAGCCAATC
TTTAACAAGA CATACTATGA GCGACTGTGT GATAAATTCA GATCACCTCA TTTATGGACA
TGGGATCGTG ACATCGGATG GAAGTTACGC CATCAGGTAT CGAACAATAA TATCGATCAA
AAAGAAACTG ATCTTCTAGC ATGGGAAGGC AATCAAGCAA AAATACAGTA A
 
Protein sequence
MTRYPFPKET DKALYRDAVS NDAYYGLPQP VSFCKLCVIS NQRPSSTIEF KNNGTKPKTV 
IQFSKDEICD ACRAKKQKSA IDWDERAKEL KNLCDRFRKT DGGYDCLIPG SGGKDSFMQA
HILKYEYGMN PLTCTWAPNI YTDWGWKNHQ AWIHAGFDNI LFTPNGKVHR LITRLAVENL
FHPFQPFILG QKNLGPKIAD LYNINLVFYG ENEAEYGNPM ADMSSALRNW EYFTASNEDE
IYLGGASLSE LRELGLKDSD WEIYLPIDPK IISKKQIEVH YLGYYKKWHP QAAYYYAIAH
GNFQSSPERT IGTYSTYNSI DDKIDDFHYH TTFIKFGIGR ATYDASQEIR SGDLVREEGV
ALVKKFDGEY PERWADEIFK YLSLPMNEFP IASKMFEEPI FNKTYYERLC DKFRSPHLWT
WDRDIGWKLR HQVSNNNIDQ KETDLLAWEG NQAKIQ