Gene A9601_14221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14221 
Symbol 
ID4718143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1195211 
End bp1196134 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content26% 
IMG OID640079143 
Producthypothetical protein 
Protein accessionYP_001009813 
Protein GI123968955 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAG AATTGTTAGT AACTGGATCA TCAGGATTTT TTGGAAGTGC ATTAATAAAT 
AGAGCCCTAA AAAGGGGATG GTTTGTTAAG GGCACAGCAA GACATTCTTT AGGAATTCTC
TCTGAACAAT TTGGTGTTGA TATCAATTAT TTAGATTTAT CAAAAGATAC AATTTCAATA
CCAAAAGCTA ATTATATAGT TCATTGCGCA ACTGCTAATG AAATTAAGTC TTTAGATTTA
TTTAAATCTA TCGATTCAAC TATAAAAGGC ACAAAAAAAT TAATTGAATA TTGTTTAGAA
AATCGATTTG AGCATTTTAT TTATATTTCG ACTGTTGGAA TTTATGGAAG AGAACTTAAT
GGAGAAATTA ATGAAAATTC TCCTTTTCAA GCAAATTCTA ATTATGCTTT AAATCATTAT
TATGCAGAAA AAATTTGTGA AAGATATGCC TCAAGAAATT TTAAAGTGAC AATAATAAGA
TTATCCAATG TTTATGGAAT TCCTTCTGTT AGCACTGTAG ATAGAAATAC ATTGGTACCT
ATATGCTTTG TAGTTAATTT ATTAAGAAAA GGTGTTGTAG AATTAAATTC TTCTGGACTT
CAGCAAAGGG ATTTTATTAA TCAAATTGAA GCATCAGATA TAGTATTAAA TTCCTTAAAT
AATCAGAAAA GTAATTTCGA TATAATTAAT GCTTCAAGCG GAAAAAGTTA TTCAATTATC
GAAATTGCAA AAATTGCATG TCAAGAATAT TCTAAATTTT CAGGAAAAGT TGGAAAAATA
ACTTCAATGC CTGATAAAAA TAATTATGAA AATAACTATA GTTTTTCTAG TAAGGCTTAT
AAAAGTAAAG ACAAAAATTT AGAATATCTT TCAATAAATG AAACTATTTC AGAGTTATTT
AAAATTTATA ATGCATTAAT TTAA
 
Protein sequence
MQKELLVTGS SGFFGSALIN RALKRGWFVK GTARHSLGIL SEQFGVDINY LDLSKDTISI 
PKANYIVHCA TANEIKSLDL FKSIDSTIKG TKKLIEYCLE NRFEHFIYIS TVGIYGRELN
GEINENSPFQ ANSNYALNHY YAEKICERYA SRNFKVTIIR LSNVYGIPSV STVDRNTLVP
ICFVVNLLRK GVVELNSSGL QQRDFINQIE ASDIVLNSLN NQKSNFDIIN ASSGKSYSII
EIAKIACQEY SKFSGKVGKI TSMPDKNNYE NNYSFSSKAY KSKDKNLEYL SINETISELF
KIYNALI