Gene NATL1_06561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06561 
Symbol 
ID4780440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp601920 
End bp603275 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content38% 
IMG OID640083934 
Producthypothetical protein 
Protein accessionYP_001014483 
Protein GI124025367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0586116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAATC ATTCATTAAC AAAATCTCCT CTCTCCTTGC CTTCTTTTTC GATTCCAAAA 
ATTAGTCTTT TTATTGGGCT AACTATCACA GGTCAATGGG TTTTGAGTGA TGTGGCCCAT
ATTCCTGGGG GTGGTCTTGG ATTGCTATTA GGACTTGGTT GTATTTTTTA TTTTTTAAAA
CCAGGGAAGG TTTCATTTGA TGCTCCCTCA ACTGTTCAAG GATGGGTAAG AAGATGTCAT
GACGTTTTAG AGAATTTTGA GTACTTACTT GAGGATGGAG AGCAAAGTGA AAGAAAAAAA
GAAAGAATAA ATTCCTTGCA AAAAATTATT GATAGAAGCG AAGATCAAAG CATTGGTTTC
TTGAAAACAA AAGGCGTAAA ATTACCTGAT GAACAGCAAT TGGAAAAAGT TTTAGGAATA
AATAACCAAA TAAAAGTTTC TTTTCCACCA GCTCTTCCTG TAAGAGATCG AAATTGGATT
TTGCCAGATT TAATCCAAGA GCAAGATTTT ATTGTTTATT CTTTGACACT TCCAATGAGC
GCAGCTGATC TTTTGTGGAT TAAAAATATC CCTACTGATC AACCAGCCTG GCTAATGGTT
GCCAGTAAAG AATCTACTGA TTGGTCTGAT GAGCGAAATG CATTAGAGGC TCAATTACCA
GATAGATGGA CTAACAGAGT ATTGAAATGG GATGGATCTC AAACAGAAAT GGCAACGGTT
CTTTCTCCAA TCAAGAAACT TCTTGAAAAT CCAAAGAGGA ATACAGACAT TACTAAGCAA
AGACTTTTGT CTCGGTTGCA TACTTCTTGG CAAAAAGATT TAGAAAAATT AAGAAGAGAA
AAATTCAAGG TTATTCAAAC AAGATCTCAG TGGATAGTTG CTGGTATCGT TTTCGCCTCT
CCTGTCGCCT CAACTGATTT GCTTGCAGTT GCAGTGGTTA ATGGCTTGAT GATCAAAGAA
ATGTCGAAAA TATGGTCTTC CAAAATGAAG CCAGAATTAC TTGAGGCAGT CTCACGACAA
CTAGCAATGG CTGCAATTGC TCAAGGAGTG GTCGAATGGA GTGGACAGTC CTTGTTGAGC
TTGGCAAAGC TTGATGGCTC CTCTTGGGTT GCTGCTGGAA CAATTCAGGC CTTGAGTGCT
GCTTATTTAA CAAGAGTGGT TGGGAGATCG ATGTCTGATT GGATGGCTCT CAATAATGGA
GTAACTCAAC CTGATTTAGA ACTTATTAAG CAACAAGCTC CTCAACTAGT ATCAAAAGCT
GCTGAGCTAG AAAGAGTTGA TTGGGTGGCT TTTTTAAAGC AATCAAAAGA ATGGATTCAG
TCTCAATCTA ATAATTACAA AGTTAAATCC GTGTAA
 
Protein sequence
MENHSLTKSP LSLPSFSIPK ISLFIGLTIT GQWVLSDVAH IPGGGLGLLL GLGCIFYFLK 
PGKVSFDAPS TVQGWVRRCH DVLENFEYLL EDGEQSERKK ERINSLQKII DRSEDQSIGF
LKTKGVKLPD EQQLEKVLGI NNQIKVSFPP ALPVRDRNWI LPDLIQEQDF IVYSLTLPMS
AADLLWIKNI PTDQPAWLMV ASKESTDWSD ERNALEAQLP DRWTNRVLKW DGSQTEMATV
LSPIKKLLEN PKRNTDITKQ RLLSRLHTSW QKDLEKLRRE KFKVIQTRSQ WIVAGIVFAS
PVASTDLLAV AVVNGLMIKE MSKIWSSKMK PELLEAVSRQ LAMAAIAQGV VEWSGQSLLS
LAKLDGSSWV AAGTIQALSA AYLTRVVGRS MSDWMALNNG VTQPDLELIK QQAPQLVSKA
AELERVDWVA FLKQSKEWIQ SQSNNYKVKS V