Gene NATL1_01761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01761 
Symbol 
ID4781084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp167973 
End bp169154 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content32% 
IMG OID640083440 
Producthypothetical protein 
Protein accessionYP_001014005 
Protein GI124024889 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.46624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA TAAAAATCAA ATGGCATGCC ACAATCCAAG AAATTCCAAA AATTATTTGG 
AATAATTTCT TAAAAGGAAA TTCAACTCCT TTTTATAAAT GGGATTGGTT GAATGCATTA
GAAAGATCAA AAAGTGTTAG TACAAAATAT GGATGGCAAC CATTATTTCT CTCTGCGTGG
AGTGAAAATA ATTTAATTGC ATGTGCACCT CTCTATCTTA AATCTCATAG TTATGGAGAA
TTTATTTTTG ATAATGCCTT TGTTCAACTA GCTCAAGATA TGGGACTTCA ATATTATCCC
AAGCTAATAG GAATGAGTCC ATTAAGTCCA ATAGAGGGAT ATCGCTTTCT GTTTGCAGAA
GGAGTTAATG ATAGAGACCT CACACAAATA TTAATCTCTG AAATTGATAG TTTTGCCAAA
CAAAATGGAA TTCTTAGTTG TAATTTTTTG TATGTAGATC CTAAATGGAT GAAAGTAGCT
GAATCTCTAA ATTGCGCTAA GTGGGTCAAC CAACAAAGTC TGTTGACATT GAATGAAGAA
AAAAGTTTTT CTGATTTTTT ACAAAAATTC AATTCCAATC AACGTAGAAA TATTAAGAGA
GAAAGAGAAA GCATAAAAAA ATGTGGAGTA AAAGTTGAAG CTCTTAGTGG GTCTCAAATA
GATGTAATGA ATTTAAAAAA AATGCATTAT TTTTATCAGC TTCATTGTTC AAGATGGGGA
GTATGGGGAA GTAAATACCT CACGGAATCA TTTTTTACTG AACTTAGATC AACAGAACTC
AAAGAAAATA TTGTTTTATT TGACGCAAAA GAAGTAGGAA TTGATAAAAC AATTGGAATG
TCCTTATGCG TGAAAAACGA AAATATGCTT TGGGGACGAT ATTGGGGTGC AGAAAAAAAT
ATAGATAATT TACATTTTGA AGCTTGTTAT TACTCCCCGA TTGAGTGGGC AATAGCAAAT
AAAATAAAAT ATTTTGACCC TGGAGCAGGA GGTAGTCACA AAAAACGCAG AGGTTTTATT
GCTAAACCCA ATGCAAGTCT TCATAGATGG TACAACTTAC CTATGGATTC ATTAATTAGA
GAATGGCTAC CAAGAGCAAA TAAGTTAATG CTTGATCAAA TAAACGCTAC AAATAATGAA
GTACCTTTTA AGTTTGAAGA GCCAAAACTA TCAAATACAT AG
 
Protein sequence
MNNIKIKWHA TIQEIPKIIW NNFLKGNSTP FYKWDWLNAL ERSKSVSTKY GWQPLFLSAW 
SENNLIACAP LYLKSHSYGE FIFDNAFVQL AQDMGLQYYP KLIGMSPLSP IEGYRFLFAE
GVNDRDLTQI LISEIDSFAK QNGILSCNFL YVDPKWMKVA ESLNCAKWVN QQSLLTLNEE
KSFSDFLQKF NSNQRRNIKR ERESIKKCGV KVEALSGSQI DVMNLKKMHY FYQLHCSRWG
VWGSKYLTES FFTELRSTEL KENIVLFDAK EVGIDKTIGM SLCVKNENML WGRYWGAEKN
IDNLHFEACY YSPIEWAIAN KIKYFDPGAG GSHKKRRGFI AKPNASLHRW YNLPMDSLIR
EWLPRANKLM LDQINATNNE VPFKFEEPKL SNT