Gene NATL1_21211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21211 
SymboldapA 
ID4780949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1778538 
End bp1779485 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content40% 
IMG OID640085418 
Productdihydrodipicolinate synthase 
Protein accessionYP_001015941 
Protein GI124026826 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTGCA GATTGCTGAA CTTCTACTGC CAAAGAATTA TGAATAAGTC AGCTTTATTA 
TCACCAGCTC CTTTTGGAAG GCTCCTAACC GCAATGGTGA CCCCATTTGA TGATGAAGGG
AAAGTTGATT ATGGTCTTGC TGCCGATTTG GCAAATTATT TGGTAGATCA AGGTTCAGAT
GGCATCGTTG TATGTGGAAC TACTGGAGAG TCACCGACTC TAAGTTGGCA AGAACAACAA
AAATTGCTGG AAATAGTAAG AAATTCCTTA GGCTCTAGGG CTAAAGTTTT AGCTGGAACA
GGCAGTAATT CGACTTCTGA GGCAATTGAA GCTACAAAGG AAGCAGCTAA TTCAGGCGCT
GATGGAGCAT TAGTTGTTGT TCCTTATTAC AACAAACCAC CGCAAGAGGG ATTAGAAGTT
CATTTTCGCG CTATTGCAAA TGCCGCTCCA AAGTTGCCTT TAATGCTCTA TAACATCCCT
GGGCGGACAG GGTGTTCAAT ATCGCCTAGT ATTGTTAGTA AGCTTATGGA TTGCAGTAAT
GTAGTCAGTT TTAAAGCTGC AAGTGGAACA ACTGAGGAAG TGACTCAATT AAGAAACTAT
TGTGGATCAG ATTTAGCTAT TTATAGCGGT GATGATGCTT TGGTTTTACC AATGCTTTCA
GTAGGGGCAG TTGGTGTTGT TAGTGTTGCA AGTCATTTAG TTGCACCTAA TTTGAAGAAA
ATTATAGAGA GTTTTTTAGA GGGTAAATAT TCTGAGGCAC TTTATTTGCA CGAGACATTA
CAACCTCTTT TTAAATCCCT TTTTGCAACT ACAAATCCAA TTCCTGTTAA AGCGGCACTT
CAACTCATCG GTTGGTCTGT TGGACCTCCT CGAAGTCCTC TAGTCTCTTT AAACAGTGAA
ATGAAAGAAG AACTCGTGAA GATACTCTCT TCTCTGAGAT TGATTTGA
 
Protein sequence
MQCRLLNFYC QRIMNKSALL SPAPFGRLLT AMVTPFDDEG KVDYGLAADL ANYLVDQGSD 
GIVVCGTTGE SPTLSWQEQQ KLLEIVRNSL GSRAKVLAGT GSNSTSEAIE ATKEAANSGA
DGALVVVPYY NKPPQEGLEV HFRAIANAAP KLPLMLYNIP GRTGCSISPS IVSKLMDCSN
VVSFKAASGT TEEVTQLRNY CGSDLAIYSG DDALVLPMLS VGAVGVVSVA SHLVAPNLKK
IIESFLEGKY SEALYLHETL QPLFKSLFAT TNPIPVKAAL QLIGWSVGPP RSPLVSLNSE
MKEELVKILS SLRLI