Gene NATL1_12111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_12111 
Symbol 
ID4779423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1060594 
End bp1062123 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content37% 
IMG OID640084490 
Producthypothetical protein 
Protein accessionYP_001015034 
Protein GI124025918 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATC CTGAAGTAAT AGTTATTGGA AGTGGTATAG GAGGTTTATG TTGCGGCGGA 
CTACTCGCAA AAGCAGGTAA AAAGGTCCTA ATTCTTGAGG CTCACTCAAA GCCAGGAGGT
GCTGCTCATG GCTTTGAGAA AAATGGTTAT AAGTTTGAAT CTGGTCCATC TCTATGGAGT
GGAATAGGTA GTTGGCCTAC TACAAATCCT TTAGGTCAGG TCCTTAAAGC TCTTAACCAA
AAAGTTGATT TAATTAAATA TCAGGATTGG AATGTTCAAA TTCCTGAGGG TGACTACACA
ATTGGAGTTG GAGATAGACG TTTTCTTGAT CAGATCAATT CAATTAGCGG AAAAGATGCC
ATTAAAGAGT GGGAAAATTT TATTCAAGTT ATTAAACCTA TTGGTGCAGC AGCTAATGCA
ATTCCTTTAT TAGCTCTAAA TCAAAACAAG GAAACCGTTT TTCAGCTGTT AAAACGTAGT
AAAACACTTA TCACACACTT GAAATCTTTT AAATATCTTG GAGGTGATTT TGGAAATTTA
GTTGATGACC ATCTTAAAGA TCCATTTTTA AGAAATTGGG TTGAATTACT TTGTTTTCTA
ATAAGTGGTT TATCTAAAGA CGAAACAAAT GCAGCAGCTA TGGCAACACT TTTTGATGAT
TGGTTTAAAC CCGATGCCTA CTTGGAATAT CCAAAGGGAG GAAGTGAATC AATCGTTAAG
GCACTATTGG AAGGGATTTA CTCATTTGGA GGGGATCTTC AACTAAATTC AAAAGTTAGT
CAGATAATAA TAGAAAGGAA TAAAGCAATC GGAATTGAAT TGAAAAACGG TGAGAAAATA
TTTGCAGATC ATATAGTTAG CAATGCAGAT ATTTGGAATA CCGTTGAGTT AATACCAAAA
GAGATATCCC AACAGTGGAG AGAGAAAAGG TCAAGGACTC CAAAATGTAA GTCATTTCTT
CATCTACATC TTGGGTTTAA TGCAGAAGGA CTAGATGATA TTCCACTTCA TTCAATATGG
GTTAATGATT GGTCTAAGGG TATTACAGCC GAGAGAAATG TTGTAGTTCT CTCTATTCCT
TCGGCATTAG ATCCAACAAT GTCTCCACCA AATAAGCACA TACTTCATGG GTATACACCT
GCGAATGAAC CGTGGGAAAG ATGGGAGGGG CTTAAAATTG GTTCAAAAGA ATATGAAAGT
ACAAAAGAAG AGAGGTGCTC AGTCTTCTGG GAACCAATAA AAAAATTAGT GCCTGATATA
GAAGAAAGAA TCGAAGTGAA AATGCTAGGA ACACCACTTA CACATGAACG GTTTTTAAAT
ACAAAAAATG GAAGTTATGG TCCAGCCTTA TCAGCTGCAG AAGGGCTTTT CCCAGGAAAT
AAAACTCCAA TTAAAAATCT ATTGTTGTGT GGCTCAAGTA CATTCCCAGG GATCGGGATA
CCACCTGTAG CAGCCAGTGG TGCCATGGCC GCCAATACAA TTCTTGGATC CAAATTTCAA
AGAGATCTAA TTGAAGAGCT AGGCATATAA
 
Protein sequence
MRNPEVIVIG SGIGGLCCGG LLAKAGKKVL ILEAHSKPGG AAHGFEKNGY KFESGPSLWS 
GIGSWPTTNP LGQVLKALNQ KVDLIKYQDW NVQIPEGDYT IGVGDRRFLD QINSISGKDA
IKEWENFIQV IKPIGAAANA IPLLALNQNK ETVFQLLKRS KTLITHLKSF KYLGGDFGNL
VDDHLKDPFL RNWVELLCFL ISGLSKDETN AAAMATLFDD WFKPDAYLEY PKGGSESIVK
ALLEGIYSFG GDLQLNSKVS QIIIERNKAI GIELKNGEKI FADHIVSNAD IWNTVELIPK
EISQQWREKR SRTPKCKSFL HLHLGFNAEG LDDIPLHSIW VNDWSKGITA ERNVVVLSIP
SALDPTMSPP NKHILHGYTP ANEPWERWEG LKIGSKEYES TKEERCSVFW EPIKKLVPDI
EERIEVKMLG TPLTHERFLN TKNGSYGPAL SAAEGLFPGN KTPIKNLLLC GSSTFPGIGI
PPVAASGAMA ANTILGSKFQ RDLIEELGI