Gene NATL1_21081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21081 
Symbol 
ID4781108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1762248 
End bp1763369 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content25% 
IMG OID640085404 
Producthypothetical protein 
Protein accessionYP_001015928 
Protein GI124026813 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2227] 2-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.781663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACT CTGACTTTTA TTTAGCCTTC GAAAATAAAT TTCGTGGGAG CTCTGTAGAT 
GTTAATGAAA AATTAGTTTT TTATGATGGT TTGCTTGAAG AGATTAGTTC TAGATTTAGT
CATTGTAATC TTTTGGATAT AGGTTGTGGT CGTGGCGAGT GGCTTGCTAA ATGTTCAGAT
CTAGGTATAA ATTCAATTGG TATAGATAAC AATGATAGTA TGTTTAATAC CTGTAAAAGA
CAAGGCTTAA ATATTAAGTA TGGTGAAGCA TTAGATATTT TAAAAACTTT AGAGAATAAT
TCTTTTCATA TGATTAGTTC GTTTCATTTT ATAGAACATA TTTCATTTAG TATGTTTTTA
GAAATTTTAG AAGAATGCAA AAGGCTTCTT ATTCCAGGAG GTGTTTTGAT TTTTGAGACA
CCAAGTATTG ATAATATCTT AGTATCTTCA AAAGATTTTT ATTTGGATCC TACTCATGTA
TCTCATATAC ATCCAGAGAC AGTAATATTT GCATTAAATT ATTTTAAATT CACAGAGTCA
AAATATTTTT TAATTAATAA ACCCTTATAC CAAAAATATG GAGATGATAG TATTTATAAT
ATCTTAAATG GAGCGGGACT AGATGTTTCT ATAATAGCTT CTTATAATGT TAACCCTCAA
GCAGTCTCAA TATTTGATCA GAGTTTAAAT TGGATTAATA ATCTAAAGAC TTCTAAAAAT
ACTTTTGAAA AATCTAATGA ATATGATAAT TTAATCAATA ATAAAATGAT TGATCTTAGT
AGAAGAATTG ATTTTTTAAA TAATCAGTTA GATACTTTAT TTCGTATATA TGAAAAATTT
TTTAATAGCT TTCCTCTTAA AGTTTTAAGA AAGATCAATG CCGTAACTCA TCTAATCAAG
GCAATTTGTG TTAAAATCTT TAAGATTTCG ATTTCCAAAA TCTTAAAGAT ATATTTACTA
GAGAAAGCTT ATTTAAAAAT TTCTAAATTA TTTTTTAAGG ATAAATTAGA TTTGTATGCT
CATTCTAAAA ATGATAGTTA TTTAAAAAAA TTTTTTCAGT CTAATCCAAG ATCAAAGGAA
ATATTGTTAG ACATTAAATC TAAATCTAAA CCAAAGCTTT AG
 
Protein sequence
MKYSDFYLAF ENKFRGSSVD VNEKLVFYDG LLEEISSRFS HCNLLDIGCG RGEWLAKCSD 
LGINSIGIDN NDSMFNTCKR QGLNIKYGEA LDILKTLENN SFHMISSFHF IEHISFSMFL
EILEECKRLL IPGGVLIFET PSIDNILVSS KDFYLDPTHV SHIHPETVIF ALNYFKFTES
KYFLINKPLY QKYGDDSIYN ILNGAGLDVS IIASYNVNPQ AVSIFDQSLN WINNLKTSKN
TFEKSNEYDN LINNKMIDLS RRIDFLNNQL DTLFRIYEKF FNSFPLKVLR KINAVTHLIK
AICVKIFKIS ISKILKIYLL EKAYLKISKL FFKDKLDLYA HSKNDSYLKK FFQSNPRSKE
ILLDIKSKSK PKL