Gene NATL1_20911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20911 
Symbol 
ID4779113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1733021 
End bp1734010 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content39% 
IMG OID640085387 
Productputative transcriptional regulator 
Protein accessionYP_001015911 
Protein GI124026796 
COG category[K] Transcription 
COG ID[COG1725] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.605524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.66693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATTCC ACATACAACA GGACAGTGAA ATCCCTGCAT CAAACCAGTT ATATAATCAA 
GTTTGTTTTG CTATTGCCGC AAGACATTAC CCACCTGGAC ATAGGTTGCC CAGTACCAGA
CAACTTGCAA TGCAAACTGG TTTACATCGA AATACTATAA GCAAAGTCTA TAGACAGCTA
GAAACAGATG GAGTTGTTGA AGCTATTGCT GGTTCAGGTA TTTATGTTCG GGATCAGCAA
AAACAAAAAG ATTTAAGAAG TAGCTCTCCT TCACTACGTA AAAAAGGCAT CAAAGATATA
GACCATGAGA TCCGTAAAAG TATTGATGAA CTTTTAAATG CTGGATGCAC CTTGCAGCAA
ACGAGAGAAT TATTTACCCG TGAAATTGAT TGGAGACTGA GATGCGGAGC TCGCTTACTT
GTTAGTACGC CTAGAGAAGA TATTGGAGCA TCATTACTAA TTGCTGAAGA ATTGGCTCCT
CATTTAGATG TCCCAGTAGA GGTTGTTCCA ATGGAAGAAC TAGAGAGTGT TTTAGAAAGC
TCAAGCAAAG GAACAGTCGT GACAAGTAGA TATTTCTTAC AGCCTTTAGA AGAATTAGCC
AAGCGACACA AAGTAAGAGC CGTAGCTGTA GATTTAAACG ATTTTAAACA AGAACTCAAC
ATTCTAAAAA AGCTACGCAC AGGAAGTTGT GTAGGGATAG TAAGTATCAG TCCAGGAATA
TTAAGGGCAG CAGAAGTTAT TTCACACAGT ATGCGCGGCA ATGAACTCCT TTTAATGACA
GCAAATCCCG ATGTAGGAAG TCGTCTTATT GCCTTATTAA GAGCAGCTAG TCACATAATT
TGTGATAGCC CTAGCTTGCC TGTTATCGAA CATACTTTGA GACAAAATAG AACCCAATTC
ATGAGAATGC CACAAATTCA TTGTGCGCAA AAGTATCTGA GCGACTCTAC AATTGAAGAG
TTGAGTAAAG AAATTGGCCT TCTCGAATAA
 
Protein sequence
MRFHIQQDSE IPASNQLYNQ VCFAIAARHY PPGHRLPSTR QLAMQTGLHR NTISKVYRQL 
ETDGVVEAIA GSGIYVRDQQ KQKDLRSSSP SLRKKGIKDI DHEIRKSIDE LLNAGCTLQQ
TRELFTREID WRLRCGARLL VSTPREDIGA SLLIAEELAP HLDVPVEVVP MEELESVLES
SSKGTVVTSR YFLQPLEELA KRHKVRAVAV DLNDFKQELN ILKKLRTGSC VGIVSISPGI
LRAAEVISHS MRGNELLLMT ANPDVGSRLI ALLRAASHII CDSPSLPVIE HTLRQNRTQF
MRMPQIHCAQ KYLSDSTIEE LSKEIGLLE