Gene NATL1_20791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20791 
Symbol 
ID4780220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1721838 
End bp1722854 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content39% 
IMG OID640085375 
Productputative carbohydrate kinase 
Protein accessionYP_001015899 
Protein GI124026784 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCTG GAAGTGTCAT TGCGATTGGA GAAGCTTTAA TAGATCGGCT TGGACCTCTT 
GGTGGAGCCC CATCTAGTGA TTTGCCAGTA ACAGATTGTT TTGGTGGCGC TCCAGCTAAT
GTTGCTTGTG CCTTGAGCAG ACTAGGAGCG AAGGTCTCTT TTATTGGCTC TTTAGGAAAT
GATGCTTTTG GAGAGGATTT TAATAATCTA TTAATCCAAA GGAGAATTAA TACCTCTGGA
TTACAGCAAG ATACTCTTCG TCCAACAAGA GTTGTATTGG TACGTAGAGA CTCTGATGGA
GAAAGATGTT TTGAAGGATT TGAGGGTGAT AAAGGCTTGG GATTTGCCGA TCAAGCCATA
TCTTTGGAAC AAATTATTCG AGACTGGCCA TTGGTTGCGG AAAATGCGCA ATGGTTAGTG
GCAGGAACAA TTCCTTTGGC TTCAGAAATA TCATCCAAAG CTTTTTTGTG GTGTATCGAA
AATGCTATGC ATTCAGGAAT AAAGATTGCC CTTGATTTGA ATTGGCGCCC AACTTTTTGG
CGAAACCAAG TTTCAAACGT CTCAGAACCC TCTGTGAAAG AAAAAAATCA AATATTGTCT
ATTTTAAAAA ACGTTTCGTT AATAAAACTC GCTAAAGAGG AGGCTCAATG GTTTTTTCAT
ACCTCTGATC CAACTGAAAT TTCTTCATCT CTTCCACAAA GACCATCTGT TGTAGTTACC
GATGGATCAA ATCCCATTTT ATGGCGACTC AATAATCACG TTGGCAAATC ATTTGCGATT
ATCCCCTCTT CTGTGGTTGA TACAACTGGA GCTGGTGATG CATTCACTGC AGGATTAATT
TATAAACTCA TCTCTGTTGA ATTAGATCAA ATCAGTGAAC AAAGTGCTAA AGATATTATT
CAATTTGGAA TTGCATGTGG CTCGCATGTT TGCAAGGGAG TAGGAGCGAT AGAACCACAA
CCTTACTTAG ATGATATTGA TAATTTATTG TCTTTATCTA AAGGAGGAAT CAGCTGA
 
Protein sequence
MRAGSVIAIG EALIDRLGPL GGAPSSDLPV TDCFGGAPAN VACALSRLGA KVSFIGSLGN 
DAFGEDFNNL LIQRRINTSG LQQDTLRPTR VVLVRRDSDG ERCFEGFEGD KGLGFADQAI
SLEQIIRDWP LVAENAQWLV AGTIPLASEI SSKAFLWCIE NAMHSGIKIA LDLNWRPTFW
RNQVSNVSEP SVKEKNQILS ILKNVSLIKL AKEEAQWFFH TSDPTEISSS LPQRPSVVVT
DGSNPILWRL NNHVGKSFAI IPSSVVDTTG AGDAFTAGLI YKLISVELDQ ISEQSAKDII
QFGIACGSHV CKGVGAIEPQ PYLDDIDNLL SLSKGGIS