Gene NATL1_00421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00421 
Symbol 
ID4779864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp43375 
End bp44829 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content28% 
IMG OID640083305 
Producthypothetical protein 
Protein accessionYP_001013871 
Protein GI124024755 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGTT TTGGAGAACA GCATAAATCT AAAAAGAAAA GAAATGAGAC AACTACATTC 
CAAGTTCCAT TTCCATTTCC TTTAAGAGAA AGTCAAGAAA ATATTAATAT TATTACTAAT
ACTCCTTCGA AACTTTCTAA AGAACAAATA ATTAATCAAG CATTAAAGTT TCATTCAAAA
GGAAACATTT CAGAAGCAAC AAAATATTAT CAATATTTTA TCAACCAAGG TTTTAAAGAT
CATAGAGTTT TTTCTAATTA TGGAGCAATC CTAAGAAAAC AGGGGAAAGT AAAAGAATCA
GGGTTTTTCA TGAGAAAAGC AATTGAAATT CAACCTAATA TCCCCATAAT AAATTTTAAC
ATGGGAAATA TATTAAAAGA TCTAGGTAAA TTAAAAGAAG CGGAAATATT TATTCGTAAA
TCAATTAGAA TGGATCCTAA TCTTACAAAA GCATATCTTT CTTTATCAAC TATTACATAC
TCAAATATCG ATAATAAAAC TTGGCAGGAT AATCTTTTTT CTGATTTTAT ATTAAAGTAT
AAATCACAAG TAGAGCAAGT TGAAATCTAC TTCGCTAGAG CAAATATTCT TCATAAAGAA
AAAAAATATA AGAAAAGTTA TCAAAACTTA AAATTGGCAA ATAACTTACA ACTTGCAATT
AAGGCAAGTG ATATTAATAA AAGAATAAAA AAGTCTCAAT TATTACTTAT TGAATCTGAT
AAGAAGGTAG TTAATAAAAA AGAAAAGATA AATTATGCCC AGAGTATTTT TATTTTAGGG
TTGCCAAGAT GTGGTTCAAC ATTATTGGAA TCAATTCTTA GTATGAATAA AGACGTATAT
GATCTAGGTG AAATTAATAT TTTAGAAGAA TCATTTATAG AGTATAAGAA GTTTATACAA
GATATTAGTC TTGCTGAACT TTATAATGAT AAAGTTAAAA ACAAGACTGA CTTGAATATC
ACAACAAATA AATGGTTATA TAACTATCAA TATGCAGGTA TTATCGCTCG CCATATTCCA
AACGCAAAAC TTATTCATTG TTTTCGACAT CCTTTAGACA ATATTCTTTC ACTTTACCGA
ACACATTTTG CTAGTGGAAA TGAATATTCT TCATCCTTAG TTGATTGTGC ACGAGTTTAT
TTAGATCAGG AGAAGGTAAT GAAAGAATAT AAAAATAGAT ATAGATCAAA AATTTACGAT
TTAAATTATG ATTCATTAGT GAACGATCCT AATAAAGAAA TTAGATCGTT AATCTCTTGG
TTGGGTTGGG AATGGCAAGA TTCATTTCTA TCTCCGCAAC TAAATTCACG CTCAGTTTCA
ACAGCAAGCA ATGTTCAAGT TCGTTCCCCA ATTAATTCAA AATCTATTGG TGGATGGAAG
AACTACAAGG ACATGCTTAA ACCTGCGATT GAAATACTTG CTAAAACCAA TGGATATAAA
AACATCGCTT CCTAA
 
Protein sequence
MKGFGEQHKS KKKRNETTTF QVPFPFPLRE SQENINIITN TPSKLSKEQI INQALKFHSK 
GNISEATKYY QYFINQGFKD HRVFSNYGAI LRKQGKVKES GFFMRKAIEI QPNIPIINFN
MGNILKDLGK LKEAEIFIRK SIRMDPNLTK AYLSLSTITY SNIDNKTWQD NLFSDFILKY
KSQVEQVEIY FARANILHKE KKYKKSYQNL KLANNLQLAI KASDINKRIK KSQLLLIESD
KKVVNKKEKI NYAQSIFILG LPRCGSTLLE SILSMNKDVY DLGEINILEE SFIEYKKFIQ
DISLAELYND KVKNKTDLNI TTNKWLYNYQ YAGIIARHIP NAKLIHCFRH PLDNILSLYR
THFASGNEYS SSLVDCARVY LDQEKVMKEY KNRYRSKIYD LNYDSLVNDP NKEIRSLISW
LGWEWQDSFL SPQLNSRSVS TASNVQVRSP INSKSIGGWK NYKDMLKPAI EILAKTNGYK
NIAS