Gene NATL1_15201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15201 
SymbolleuA 
ID4780698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1233783 
End bp1235402 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content39% 
IMG OID640084802 
Product2-isopropylmalate synthase 
Protein accessionYP_001015342 
Protein GI124026226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG ATCCGGGCCG AGTTTTAATT TTTGACACTA CATTAAGAGA TGGAGAGCAA 
TCTCCTGGAG CTAGTCTTAA TTTAGAAGAA AAGTTAGCTA TTGCTCAACA ATTAGCAAGA
TTAGGAGTTG ATGTTATTGA GGCAGGATTC CCTTTTGCTA GCCCTGGGGA TTTCGCTGCA
GTTCAGAAAA TAGCTGAGAA TGTAGGAGGA GAAGAAGGAC CTATCATTTG CGGACTATCA
AGAGCCTCAA AACCTGATAT CAAAGCTTGT GCCAATGCGA TTGCTCCAGC CCCAAAAAAA
AGGATTCATA CCTTCATTGC AACAAGTGAT ATACATCTTG AACATAAATT AAGGAAATCC
AGAAAAGAAG TACTTGATAT CGTTCCAGAT ATGGTTGGCT ATGCTAAAAG TTTTGTTGAT
GACGTTGAAT TTTCCTGTGA AGATGCAGCA AGAAGTGATT TAGATTTTCT TTATGAAGTA
ATAGAACTAG CCATATCCTC AGGGGCTAAT ACAATAAATA TTCCAGATAC AGTTGGTTAT
ATAACCCCTT CTGAATTTGG AGATTTGATA TTAAATATCA ACGAAAATGT TCCAAATATC
AATGAGGCAG TTCTGTCAGT TCATGGTCAC AACGATTTAG GACTTGCTGT CGCAAACTTC
CTTGAAGCTG TAAAGAATGG AGCTAGACAA CTTGAATGCA CCATTAACGG AATAGGTGAG
AGAGCAGGTA ATGCTGCTTT AGAAGAATTA ATCATGGCGC TTCATGTAAG AAGATCATAT
TTTAATCCAT TTTTTGGGAG GCCTCCTGAA TCCCCTACTC CTTTGACAGC AGTTAGAACA
GAGGAGATAA CTAAGTCTTC TCGCTTGGTT TCAAATTTGA CTGGGATGGT CGTACAACCG
AACAAAGCAA TTGTTGGGGC AAACGCTTTT GCGCATGAAT CTGGAATACA CCAAGATGGA
GTATTGAAAA ATAGGCTTAC ATATGAAATT ATCGATGCAA AAACAGTAGG GTTGTCTGAC
AATAAGATTT CTTTGGGAAA ATTAAGTGGT AGGAGTGCTG TTCGAGCAAG ATTAGAGGAC
CTTGGATATG ATTTAAACAG AGAAGATCTT AATGACGCTT TCGCTAGATT TAAAGATTTA
GCCGATAGAA AAAGAGAGAT AACAGATCGT GATCTAGAGG CCATTGTTAG TGAACAAGTT
CAGCTGCCAG AAGCATTGTT CCAATTAAAA TTGGTCCAAG TAAGCTGTGG CACTTCTCTA
ATGCCAACTG CAACAGTAAC TGTTGTTGGA GAAGATGGAG AGGAGAAGAC CGCCGTCTCT
CTTGGAACAG GTCCTGTTGA TGCAGTAGTA CGAGCCTTGG ATTCCCTAAC TGAAGAACCT
AATGAATTGA TTGAATTCTC AGTAAAGTCA GTTACAGAGG GGATAGATGC TCTGGGTGAA
GTTACTATTA GAATAAGAAG AGATGGAAAT CTCTTTTCTG GCCATTCTGC AGATACTGAC
GTTGTTGTTG CCGCTGCTCA AGCATACATA AATGCTCTTA ATAGATTAGT AGCTGCTCAT
GGAAGGAAAT CCATTCATCC ACAACATGAT TTGGCTAAGG TAGACAAAAA AGGGATTTGA
 
Protein sequence
MAKDPGRVLI FDTTLRDGEQ SPGASLNLEE KLAIAQQLAR LGVDVIEAGF PFASPGDFAA 
VQKIAENVGG EEGPIICGLS RASKPDIKAC ANAIAPAPKK RIHTFIATSD IHLEHKLRKS
RKEVLDIVPD MVGYAKSFVD DVEFSCEDAA RSDLDFLYEV IELAISSGAN TINIPDTVGY
ITPSEFGDLI LNINENVPNI NEAVLSVHGH NDLGLAVANF LEAVKNGARQ LECTINGIGE
RAGNAALEEL IMALHVRRSY FNPFFGRPPE SPTPLTAVRT EEITKSSRLV SNLTGMVVQP
NKAIVGANAF AHESGIHQDG VLKNRLTYEI IDAKTVGLSD NKISLGKLSG RSAVRARLED
LGYDLNREDL NDAFARFKDL ADRKREITDR DLEAIVSEQV QLPEALFQLK LVQVSCGTSL
MPTATVTVVG EDGEEKTAVS LGTGPVDAVV RALDSLTEEP NELIEFSVKS VTEGIDALGE
VTIRIRRDGN LFSGHSADTD VVVAAAQAYI NALNRLVAAH GRKSIHPQHD LAKVDKKGI