Gene NATL1_18421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18421 
Symbol 
ID4780607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1504193 
End bp1505185 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content39% 
IMG OID640085131 
Producthypothetical protein 
Protein accessionYP_001015662 
Protein GI124026547 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.465045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAG ACATTCTGCT TTTAACTGCT CTCGTTGCAG TTATTTTAAT GGGTTCTGCA 
ATGTGCTCAG GGATTGAAGC AGCATTATTA GCAGTAAACC CATTACGCGT ACATGAGCTC
GCAAGGAGAA AGCCCAAAGT ACTTGGAGCT AGAAGATTAG AAAAATTACG CCACAGAATT
GGAAGGACTT TAACTGTAGT AACAATTGCA AATAACAGTT TCAATATTTT TGGAAGTTTG
ATGGTTGGAA GCTACGCAAC TTACATATTT CAAGATCGAA TAGGAAATGT AAAATCCATA
TTTTTTGTTG GCCTAACTAT TCTTGTTTTA CTTCTTGGAG AAATTGTTCC CAAAGCTCTC
GGCACAAGAC TTGCATTGCA AATTAGTTTA ACAAGCGCTC CTGTCCTGGA TTTCTTAAGC
ATAGTTATGC GTCCATTGCT AATAGTTCTT GAACGTCTAC TACCAATCAT CACTGCCAAG
AGCGAGCTAA CAACAGACGA AGAAGAAATA AGACAGATGG CCAGACTTGG ATCTCAAATA
GGTCAAATAG AAGCTGATGA GGCTGCAATG ATATCCAAAG TTTTCCAGCT AAATGACCTT
ACTGCTAAAG ATTTGATGAC TCCACGTGTT GCCGCTCCAA CACTTCCAGG AAGAGTTTCT
TTACAATCTG TCAAATCAAA CTTATTAGAA AATAATGCAA CATGGTGGGT AGTATTAGGT
GAAGAAGTAG ACAAGGTTGT TGGAGTTGCT AACCGTGAAA AGTTATTAGC CTCTTTACTT
CAAGGAAACT CCCATTTAAC TCCTTATGAT CTAAGCGAGA ATGTAGAGTT TGTACCCGAA
ATGATTCGAG TAGATAGACT ACTTCTTGGT TTTAATGAAG ACAAAAATGG AGTTAGAGTT
GTGGTAGATG AGTTTGGTGG ATTTGTTGGT TTAATAGGAG CAGAAGCTGT ATTGGCAGTT
TTAGCTGGTT GGTGGAGGAA GTCAAATAAA TGA
 
Protein sequence
MSQDILLLTA LVAVILMGSA MCSGIEAALL AVNPLRVHEL ARRKPKVLGA RRLEKLRHRI 
GRTLTVVTIA NNSFNIFGSL MVGSYATYIF QDRIGNVKSI FFVGLTILVL LLGEIVPKAL
GTRLALQISL TSAPVLDFLS IVMRPLLIVL ERLLPIITAK SELTTDEEEI RQMARLGSQI
GQIEADEAAM ISKVFQLNDL TAKDLMTPRV AAPTLPGRVS LQSVKSNLLE NNATWWVVLG
EEVDKVVGVA NREKLLASLL QGNSHLTPYD LSENVEFVPE MIRVDRLLLG FNEDKNGVRV
VVDEFGGFVG LIGAEAVLAV LAGWWRKSNK