Gene NATL1_10751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10751 
Symbol 
ID4781013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp989843 
End bp990880 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content34% 
IMG OID640084354 
Producthypothetical protein 
Protein accessionYP_001014898 
Protein GI124025782 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0490933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0263054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATAAGC AAGTAGAAAT TTTGGAGTCA GATGCAAAGG AAAAGCTTGT TGTAGCAGTG 
ACTATGGCCA CAGGCAGGCA AGGTATTGGT GTCGTTAAAG AATTAAGCCA AACAGATAAA
TACCAAATAC GTGCTATCAC AAGAAATATA AAGAGTACAA AGGCTCTAGA GCTAGGAAGC
CTAAACAACG TTGAACTAGT CAAGGGAGAT TTAATGGATC CTGAAAGCCT TAAAAAAGCT
TTTGAAGGAG TTGATGTGAT TTTTGGAAAT ACAACACCTA CAAAAGGATG GAAATTATTT
AGAGGAAGTA TCGTCAGATC TTATGAAATG GAACAAGGTT ATAACTTAAT AAATCAAGTC
AAAACTGCCT ACGAAAAAGG ATGTCTAAAT CACTTTATAT TTAGCTCAAT TAGTAAAGCA
AAAGACCCAC TAAAAAATGA TCCTGCTCCA GGACATTTTA CGAGTAAATG GGATATTGAA
GAATATATAG AAAAATCAGG TCTTAAAAAA ATTACTACTG TATTAAGACC CGTTAGCTAC
TTTGAAAACT TTGAAAACAA ATTACCTGGC TATACAATTT CAAAGAAAAT TTTTCCAGGA
ATAGTTGGCA AGAATTTTAA GTGGCAAACA ATCGCAGTAG AAGATGTAGG TAAATGGGTT
AGAGGTGTTT TATCAAAATC AGAGAAATAT AAAAATCAAT CTATCAATAT TGCCGGCGAG
GAACTAACAG GACTGGAAAT GGCTATGACA CTTCAAAGAA TAGTTTCTTC AGAAGGACTA
AAAACAAATT ATGTGATGAT CCCTAGATTA GCAATTAAGT TATTGGAATA CGACATTGGC
GTTATGGCAG ATTGGATTGA AAGATCAGGC TATGGAGCTG ATATGAATAA TCTTCAATCG
ATTCAGGAAG AGTTAAATAT TGCTCCTACA TCACTTAAAG ACTGGCTAAA GACAAAACTT
AAAAAACAAA CTAAGAAACA AAATTCATGG GCAAGGCAGT GGAAATCATC TCAGTGGAAA
CTTCAATGGG ATAAATAA
 
Protein sequence
MHKQVEILES DAKEKLVVAV TMATGRQGIG VVKELSQTDK YQIRAITRNI KSTKALELGS 
LNNVELVKGD LMDPESLKKA FEGVDVIFGN TTPTKGWKLF RGSIVRSYEM EQGYNLINQV
KTAYEKGCLN HFIFSSISKA KDPLKNDPAP GHFTSKWDIE EYIEKSGLKK ITTVLRPVSY
FENFENKLPG YTISKKIFPG IVGKNFKWQT IAVEDVGKWV RGVLSKSEKY KNQSINIAGE
ELTGLEMAMT LQRIVSSEGL KTNYVMIPRL AIKLLEYDIG VMADWIERSG YGADMNNLQS
IQEELNIAPT SLKDWLKTKL KKQTKKQNSW ARQWKSSQWK LQWDK