Gene NATL1_03581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03581 
Symbol 
ID4781091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp330652 
End bp332112 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content40% 
IMG OID640083626 
Productphosphotransferase superclass 
Protein accessionYP_001014187 
Protein GI124025071 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1109] Phosphomannomutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.37944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAAAGTTGAA TCTCACTAAG GAGAGAATTT CTTTCGGAAC AGATGGATGG 
AGAGGGATAT TAGGAGTTGA GTTCACCTTG GAAAGATTAC TTAAAGTTGC AGCAGCAGCA
GCTCAAGAGC TTGATTATGT GGAAGAAAAA AATAATAATA AAATAATCAT TGGTTATGAC
CGACGATTCC TAGCAGAGGA GATGGCTGAG GCGGTCGCAT CTGCAGTGAG AGGAGTTGAT
TTAGTCCCTT TGTTGGCTTC TTCCGCGCTG CCAACTCCCT CTTGTAGCTG GGGGATAGTT
GAAGAAAATG CACTTGGTGC ACTAGTGATT ACAGCAAGTC ATAACCCATG CGAATGGTTG
GGTTTGAAAA TTAAAGGTCC TTTTGGAGGC TCAGTTGATA GCTCTTTTAC CGATTCCGTG
CAAAAAAGAT TAGATGCTGG GGGGATATCA ATTCCAATCG AAGGAGTAAC CGAGAAAGTT
GATTTTCGAA AACAACATCT TTTGGGGATT AGTCAGGAAT TTGACATGCA TTTGATTGCT
GATGGCTTGA GGAAACTAGG AGTGAAAATT TTTGTCGATC CAATGCATGG ATCTGCTGCG
GGCTGCATGT CTGAATTATT TGGAGTTGAT AGTGAAGAGC TTATTTATGA AATAAGAACT
GAAAGAGACC CAAGCTTTGG TGGGAATCCT CCAGAACCTC TGAAGGCTTA CCTATCGCAA
TTAATACAAG AAGTTCAGGA TGAATCCCAG GCAGGGAAAT TGTCTATGGG CCTTGTTTTT
GATGGAGATG GAGATCGAAT TGCGGCAATA GATGAAAAAG GTAGATATTG CAATACGCAG
TTATTAATGC CTGTCTTGAT AGATCATTTG GCAAGAGTTA GAAATATGCC AGGTTGCGTT
GTAAAAACTG TGAGTGGATC GGACTTGATG AGATTAGTTG CTGAGGATCT GGGGAGAGAA
GTGCTCGAAA AGCCAGTTGG GTTTAAATAT ATTGCTGAAG AAATGCTTTC AAGAGAAGTT
CTTATTGGAG GAGAGGAGTC AGGGGGAGTT GGGTTTGGAC ATCACTTGCC AGAACGTGAT
GCTTTGTTTA CCGCTTTGCT TTTGATGGAG TCAATAGTTG CTGATGGTAA ATGTTTAGGT
GAGAAAATAG ATTCTCTTCA TGCTCGTTTT GGTAAGAGTC ATTTCGAACG TATTGATTTA
ACTCTCAAAG ACATGGAGAT GAGAAGGTCT TTGGAAGATT TTTTGAAACA GAAAACCCCA
TCTTCAATTG GTCATAAATC AGTTTTAGAG GTTATTTCAA CTGATGGAAT AAAACTCATA
CTTGATAAAA GTCATTGGCT GATGTTCCGT TTCTCTGGAA CAGAGCCTCT TTTAAGAATT
TATTGTGAAG CGCCATCCAG TGCAGAAGTT ACTTCAACTT TGTATTATGC AAAGCAACTT
ATAGATAATA GTTTTGGATA A
 
Protein sequence
MKKKKLNLTK ERISFGTDGW RGILGVEFTL ERLLKVAAAA AQELDYVEEK NNNKIIIGYD 
RRFLAEEMAE AVASAVRGVD LVPLLASSAL PTPSCSWGIV EENALGALVI TASHNPCEWL
GLKIKGPFGG SVDSSFTDSV QKRLDAGGIS IPIEGVTEKV DFRKQHLLGI SQEFDMHLIA
DGLRKLGVKI FVDPMHGSAA GCMSELFGVD SEELIYEIRT ERDPSFGGNP PEPLKAYLSQ
LIQEVQDESQ AGKLSMGLVF DGDGDRIAAI DEKGRYCNTQ LLMPVLIDHL ARVRNMPGCV
VKTVSGSDLM RLVAEDLGRE VLEKPVGFKY IAEEMLSREV LIGGEESGGV GFGHHLPERD
ALFTALLLME SIVADGKCLG EKIDSLHARF GKSHFERIDL TLKDMEMRRS LEDFLKQKTP
SSIGHKSVLE VISTDGIKLI LDKSHWLMFR FSGTEPLLRI YCEAPSSAEV TSTLYYAKQL
IDNSFG