Gene NATL1_19081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19081 
Symbol 
ID4779845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1570425 
End bp1571501 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content45% 
IMG OID640085198 
ProductWD-40 repeat-containing G-protein 
Protein accessionYP_001015728 
Protein GI124026613 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGTA TAGAAGCATT TAGTCCCAAG GGAATGCTTC ATGAAAGTTG GTCTGCTCAA 
GCTAACGACT ACGCAATTGT CTGCGGCTGG GCACTACAAG GTAAAACTTT TTTAGTAGGT
GATGTCGCTG GTGGGCTTTA TGCATTTGAG GGAATATCTG GAAAGCTCAT TTGGCAAATA
AAAGACATAC ATAAAGGTGG CTTACTCGCA ATGTCTATAC ATCCAAATGG AAAGACTTTT
GCAACTGCTG GCCAAGATGG ACATGTAAAT ATATGGGAAA GCCAAAAGGG TACGTCAACT
AAAACTTTGG AACTTGGGAA AGGATGGGTT GAGCACATCA AGTGGTCCCC AGACGGAAAA
TTTTTAGCTG TAGTTTTTAC TAAATACGTC TATGTTTTTG ATGATAAAGG TCAAGAACAT
TGGCGATCAG ATGAGCATCC CAGCACTGTC AGCGCGATTG CTTGGTCTAA TTCAAATGAA
TTAGCAACAG CATGCTATGG CCAAGTCACT TTTTTTGATG TAGTAAACGA CAAGATCAAT
CAAAAGTTGG AATGGCGAGG CTCGCTAGTA TCGATGGTGC TTAGTCCAGA TGGAGACATA
GTGGCATGCG GCAGCCAAGA TAATTCTGTT CATTTCTGGC GTCGTTCAAC TGATCAAGAT
TCAGAGATGA CAGGCTACCC AGGTAAACCA AGCCACCTAG CTTTTGATCA AACCGGCACA
GTCCTTGCTA CTGGGGGTAG TGATCGCGTG ACGGTTTGGA GTTTTCAAGG CGATGGTCCT
GAGGGAACTG TACCAGGAGA GTTAATGCTT CATACGGAAC CCATTTCATG TCTTGCTTTT
TCACACAGCG GGATGCTTCT TTTAGCTTCT GGCGCGAGAG ATGGTTCAGT TTTTTCTTGG
TTTCTCCAAA AAGATGGTCA GGGTGATCCA GTTGGTGGTG CATTTGCCGG TGACCTTGTA
AGCCAAATCG CTTGGCACCC TGATGACACT GCTTTGGCTG CAATAAATGC AAACGGAGGA
ATTACGGTTT GGGAGTTTAA GGTTCGGACG AAAACGTCAG CTCAAGGATT CGGATAA
 
Protein sequence
MPGIEAFSPK GMLHESWSAQ ANDYAIVCGW ALQGKTFLVG DVAGGLYAFE GISGKLIWQI 
KDIHKGGLLA MSIHPNGKTF ATAGQDGHVN IWESQKGTST KTLELGKGWV EHIKWSPDGK
FLAVVFTKYV YVFDDKGQEH WRSDEHPSTV SAIAWSNSNE LATACYGQVT FFDVVNDKIN
QKLEWRGSLV SMVLSPDGDI VACGSQDNSV HFWRRSTDQD SEMTGYPGKP SHLAFDQTGT
VLATGGSDRV TVWSFQGDGP EGTVPGELML HTEPISCLAF SHSGMLLLAS GARDGSVFSW
FLQKDGQGDP VGGAFAGDLV SQIAWHPDDT ALAAINANGG ITVWEFKVRT KTSAQGFG