Gene NATL1_17071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17071 
Symbol 
ID4781156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1390933 
End bp1392987 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content35% 
IMG OID640084991 
Producthypothetical protein 
Protein accessionYP_001015527 
Protein GI124026412 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.331624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAATTAC CTATTGATCA TTTTCGCTTA TTGGGGGTTA GCCCTTCTGC AAATGCTGAA 
GAAGTCCTCA GGGCTTTTCA GCTAAGGCTA GATCGTCCTC CCAAGCAAGG CTTTACTTAT
GAAGTTTTAG CTCAGAGGTC TGAGCTTCTA AGGCTTTCTG CTGATCTTTT ATCTAATCCT
GCCGAACGAC AATCTTACGA ACTCGCTCTG ATTGAGGGTT CGTCTGGACT TGAATTATCT
TCAAATAGAG AAGTTGCTGG TTTACTTCTC TTGTGGGAAT CTAATGCTTC TTTCCAAGCT
TTCAAGCTTG CAAAAAAAGC ATTACAACCT CCACAAGCCC CAGCCTTGGG AAGTGGTAGA
GAGTCAGATT TAACATTAAT AGCAGCTTTA GCTTGTAGAG ATGCTTCTAT TGAGGAGCAA
GCTTGCAGAA GATACGCTTC AGGTGCTGAT TTGCTTCAGG AAGGTATACA GTTATTGCAG
AGGATGGGAA AGCTTGTTGA AGAAAGAAAA ACCCTTGAAT CCGATTTAGA GTCTTTACTT
CCATACAGAA TTCTTGATTT ATTAAGTAGA GAGAAAGAAG AAGAAAAATC TCATCAGGAA
GGCCTGATGT TGCTGGAAGA CTTTGTTAAT AAAAGAGGTG GACTCGAAGG AAAAAGAAAT
TCAGAAAAAA TAGCAGGATT AAATCAAAAT GATTTTGAGC TGTTTTTCCT CCAAATCAGA
AAATTTTTAA CTGCTAAAGA ACAGTCAAAA ATTTATGTAA ACTGGTATAG AAGAGGTTCC
GAAGATGCTG GCTTTCTTGC TGCTTTTGCT TTGATTGCTT CTGGCTATTC TTATAGGAAA
CCAGAACTCT TGCAAGAAGC TCGGAAATAT CTTCGAAATA TCAACATTAA TGGCTTTGAC
CCTATGCCAT TAATTGGATG CCTAGATCTT TTATTAGGGG ATGTAACGCA AGCAGAGTCT
CGTTTTCGAA GTAGTTCAGA TGAGAAATTA AAAGACTGGT TAGACAATTA CCCAGGCGAA
ACATTGGGAG CTTTATGCGA CTACTGTAGA AATTGGTTGA AAAAAGATGT ATTAGTAGGT
TTTAGCGATG TTGAGATACA AACTGTAAAT CTTGATGATT GGTTCGCTAG CCAAGAAGTT
CAAGTTTATG TAGAGCAGTT GGAATCAAAA GGGGCATTAG GCATTGCAAA AGCTGGATTT
TCTTTCTTAT CGTCATTGAC CCCTGAACAA CAAATTGAGA ATAATTCATC AAGAAACTTA
GAGGAAAAGG CAGATTTGCC AATGCCGGGT GGGGCTTTAG GTGAGAATTT TAATGAAAGC
TCTTTCAAAT CACGATTAGA TATTAAAGAA TTTTTCTTAA GATCTAATTT GGTTGAAAAG
ATAGTTTCAA AATATTATTC AATATTTGAA TTGATAAAAA ACTCAGATTT CAAATCTTTC
ATATTAAAAC GACCAATATA TACAAGTTCT TTAGCATTTA TTGGTTTATT TATTGTTGGA
ACAAGCTTAG GAGTCCTTAC GCAAAGAAAA CCCTCGCAAA ATAATGATCT CAGCAATATC
TCTAAGTCTG AATTGGTTAA ACCAGAAGAT ATTAAAAATA GAGATAATGG ATCAACTAAA
ATAGAAAATA ATAAAGAAAA ATTAGATTTA AAAAAATCAA TTCCTCTTAC TTCATTAGAC
CCCTCAAATC AAGAAATTAA ATCTCTTGTT GAATCATGGT TGGAAGGTAA GGCGGACATT
CTGAATGGTT CGGAAAGTCA GTTTCTTTCT TCTGTTGCTA GAGCCTCTCT ATTCAATAGA
GTTATCGAAC AGAGAAAGAA AGATAAACTT TTAGGACAAA GACAGATTAT TAATGCAGAT
ATAACTTCAA TCAATATTGT TCAAAAATCT GACAGGAGAA TTGCAGCAGA TGTTGAATTA
AACTATCAAG ATAAATTGAT TAGTTCTTCG GGTGAGATTT TATCTGAAAC GGTTATTCCT
TCTTTGAAAG TTAAATATAT AATAGGTAAG AATAAAAAAA ACTGGCTAAT AGTTGACTAT
ATTAGTGGAA ATTAA
 
Protein sequence
MELPIDHFRL LGVSPSANAE EVLRAFQLRL DRPPKQGFTY EVLAQRSELL RLSADLLSNP 
AERQSYELAL IEGSSGLELS SNREVAGLLL LWESNASFQA FKLAKKALQP PQAPALGSGR
ESDLTLIAAL ACRDASIEEQ ACRRYASGAD LLQEGIQLLQ RMGKLVEERK TLESDLESLL
PYRILDLLSR EKEEEKSHQE GLMLLEDFVN KRGGLEGKRN SEKIAGLNQN DFELFFLQIR
KFLTAKEQSK IYVNWYRRGS EDAGFLAAFA LIASGYSYRK PELLQEARKY LRNININGFD
PMPLIGCLDL LLGDVTQAES RFRSSSDEKL KDWLDNYPGE TLGALCDYCR NWLKKDVLVG
FSDVEIQTVN LDDWFASQEV QVYVEQLESK GALGIAKAGF SFLSSLTPEQ QIENNSSRNL
EEKADLPMPG GALGENFNES SFKSRLDIKE FFLRSNLVEK IVSKYYSIFE LIKNSDFKSF
ILKRPIYTSS LAFIGLFIVG TSLGVLTQRK PSQNNDLSNI SKSELVKPED IKNRDNGSTK
IENNKEKLDL KKSIPLTSLD PSNQEIKSLV ESWLEGKADI LNGSESQFLS SVARASLFNR
VIEQRKKDKL LGQRQIINAD ITSINIVQKS DRRIAADVEL NYQDKLISSS GEILSETVIP
SLKVKYIIGK NKKNWLIVDY ISGN