Gene A9601_17031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17031 
Symbol 
ID4718434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1446564 
End bp1447790 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content34% 
IMG OID640079430 
ProductL,L-diaminopimelate aminotransferase 
Protein accessionYP_001010093 
Protein GI123969235 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03542] LL-diaminopimelate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTCAAG TAAACGAAAA TTATTTAAAA CTCAAAGCAG GCTATTTATT CCCTGAAATT 
GCTAAAAGGG TAAAGCTATA TTCTCAATCA AATAAGAATG CTGAAATTAT CAAGCTTGGA
ATAGGAGATG TTACAGAACC ATTACCAAGA GCATGCATTG AGGCTATGGG TAAAGCTTTA
GATGATATGG GCACAACAGA TGGTTTTAGA GGTTATGGAC CAGAACAAGG TTATGCTTGG
CTCAGAGAAA AAATATCTGA GCATGATTTT ATTTCGAGGG GCTGTCAAAT TTCACCTGAA
GAAATCTTTG TTTCAGACGG ATCAAAATGC GATAGTAGCA ATATTTTAGA TATTCTTGGC
AAGGATAATT CAATTGCTGT AACAGATCCT GTTTACCCTG TTTATGTAGA TAGTAACGTG
ATGACAGGTA GAACTGGAGA TGCTCTTGAA AATGGTACTT ATCAAGGATT GACATATCTT
GCAATAAATG AAGCGAATAA CTTTTTGCCA GAACTACCTG AAAAAAAAGT TGATATTTTA
TATCTTTGTT TTCCTAATAA TCCAACTGGA GCAACGATTA ATAAAGAAGA ATTGAAAAAA
TGGGTTGACT ATGCACTTCA AAACAAATCC TTAATACTTT TTGACGCAGC TTATGAAGCA
TTTATTCAAG ATAATGATAT TCCACATTCA ATATATGAGA TTGAGGGAGC AAAGGATTGT
GCTATTGAAT TTAGATCTTT TTCAAAAAAT GCAGGATTCA CTGGAGTTAG ATGTGCTTTT
ACAGTAATAC CTAAAAATCT CAAAGGTTTG AGCTCAACAA ATGAGGAAAT AGAGTTATGG
CCTCTTTGGA ATAGGCGACA ATCTACAAAA TTCAATGGAG TAAGTTATGT TGTTCAGAAA
GGAGCAGAGG CTGTTTATTC TCTTGAAGGG AAGAAACAGG TGAGAGGTTT AATTGATTTT
TATATGGAAA ATGCAAAAAT AATGAAAAAT AAACTTCAGA ATTCTGGATA TAAAGTTTAT
GGTGGGGACA ATGCTCCTTA TATCTGGATT AAGGTTCCAG ATCAAATGAC ATCTTGGGAC
TTTTTTGATT TCCTTCTACA AAAAGTTAGT GTAGTGGGAA CACCTGGGAG CGGATTTGGA
TTGGCAGGAG AGGGTTATTT TCGCTTGTCA GCATTTAACT CACGATCAAA CGTCATTGAT
GCAATGGAAA GGATTATTAA TATATAA
 
Protein sequence
MVQVNENYLK LKAGYLFPEI AKRVKLYSQS NKNAEIIKLG IGDVTEPLPR ACIEAMGKAL 
DDMGTTDGFR GYGPEQGYAW LREKISEHDF ISRGCQISPE EIFVSDGSKC DSSNILDILG
KDNSIAVTDP VYPVYVDSNV MTGRTGDALE NGTYQGLTYL AINEANNFLP ELPEKKVDIL
YLCFPNNPTG ATINKEELKK WVDYALQNKS LILFDAAYEA FIQDNDIPHS IYEIEGAKDC
AIEFRSFSKN AGFTGVRCAF TVIPKNLKGL SSTNEEIELW PLWNRRQSTK FNGVSYVVQK
GAEAVYSLEG KKQVRGLIDF YMENAKIMKN KLQNSGYKVY GGDNAPYIWI KVPDQMTSWD
FFDFLLQKVS VVGTPGSGFG LAGEGYFRLS AFNSRSNVID AMERIINI