Gene NATL1_06241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06241 
Symbol 
ID4779514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp568193 
End bp569257 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content37% 
IMG OID640083901 
Productdihydroorotase 
Protein accessionYP_001014451 
Protein GI124025335 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGCTT CTGTTAATCA AATCTCATTA TTAAAGCCGG ATGATTGGCA TCTGCATTTG 
AGAGATGGAA AGATTCTTAA AGGTGTTTTA AGTCATACAG CAGATGTGTT TTGCCGTGCA
ATCATCATGC CTAATCTTGA TCCGCCAATA ACTACTTTAA GCCAAGCACA AGAGTATAAA
AAAAGAATTA TCCAGTCTAT CCCTGAAGGT GTTTCTTTTA CCCCATTAAT GACAGCATAT
CTTACAGATG ATATGCCTGC GAATGTTTTA GAGAGAGGCT TTAGAGAAGG TGTCTTTCAT
GGGGCAAAGC TCTATCCAGC TAATGTGACA ACTAATTCTT CTTATGGGGT TACAGATATA
AGTAAAGTCG GCAATTTATT TGAGACGATG GAAAGAATTG GTATGCCATT ATTGATTCAT
GGAGAAGTGA CCGATTTCAA TGTTGATGTA TTTGATAGAG AAGCTGTTTT TATTGAGCGT
CACCTTGAAC CACTATTACG AACATTTTCA TCACTTAAAG TGGTTTTAGA ACACATCACG
ACCATAGATG CAATTGACTT TGTAGAAAAC AGTGAGTTTG ATATAGCCGC TACAATCACA
CCTCATCATC TACATATCAA TCGAAACGCA ATGTTCAATG GTGGTTTAAG GAGTGATTTT
TATTGCTTAC CCACAGCTAA ACGTGAAATC CATCGTATTG CTCTAAGACA AGCGGCTACT
AGCGGTAAAA CTTGCTTTTT CCTTGGAACT GATTCAGCAC CTCATACCCG TAGATTTAAG
GAAAGTTCAT GTGGATGTGC AGGAATCTTT AATGCCCCTT TCGCTTTGGA AAGCTATTTA
AAAGTTTTCG AAGAAGAAAA TGCCCTAGAT AGGTTTGAAG CTTTTTCAAG TATTAATGGA
GCAACTTTTT ATGGATTACC TTTAAACACA GAGAGAATAA CTTTAATTAG AAAAGATATT
TCCGTACCTC AAATGATTGA TGTTGGATTA GATGGTAATC CCAATGATTT TGTAAAACCA
TTTCATTCAG GAGAAACTCT TAGCTGGGCA ATAAAGGATG TTTAG
 
Protein sequence
MIASVNQISL LKPDDWHLHL RDGKILKGVL SHTADVFCRA IIMPNLDPPI TTLSQAQEYK 
KRIIQSIPEG VSFTPLMTAY LTDDMPANVL ERGFREGVFH GAKLYPANVT TNSSYGVTDI
SKVGNLFETM ERIGMPLLIH GEVTDFNVDV FDREAVFIER HLEPLLRTFS SLKVVLEHIT
TIDAIDFVEN SEFDIAATIT PHHLHINRNA MFNGGLRSDF YCLPTAKREI HRIALRQAAT
SGKTCFFLGT DSAPHTRRFK ESSCGCAGIF NAPFALESYL KVFEEENALD RFEAFSSING
ATFYGLPLNT ERITLIRKDI SVPQMIDVGL DGNPNDFVKP FHSGETLSWA IKDV