Gene A9601_18631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18631 
Symbolasd 
ID4718601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1596868 
End bp1597899 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content34% 
IMG OID640079597 
Productaspartate semialdehyde dehydrogenase 
Protein accessionYP_001010253 
Protein GI123969395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0915019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACAAT TTCCTTATTT GCCTAATAGG CCATTAAAAG TTGCTGTTTT AGGTTCTTCA 
GGTGCTGTGG GATCTGAATT GCTAAAAATT CTTGAACAAC GTGATTTCCC AATATCAGAA
TTGGTCTTGC TTTCATCAGA GCGGTCAGAA GGAAAAAAAA TTATTTGGAA AGGTGAAGAA
CTAGTTACAA AAAAAACAAC TAAGGAAGAA TTTAAGAATC TTGATCTAGT TTTGGCGTCA
GCTGGCGGAA GTATTTCAAA AAAGTGGTTA TCTACCATTA TTGATCAAAA TGCTTTACTG
ATAGATAATT CAAGTGCTTT CAGATTAGAT AAGAACGTTC CTCTTATAGT CCCTGAAGTT
AATGCTAGTG ACGTACTTAA TCATGATGGG GTAATAGCCA ATCCAAACTG CACTACCATT
TTGTTGACAT TAGTTTTAGC TCCATTAAAC AAACTTTCTA CTATTCAAAG AGTTATTGTC
TCAACATATC AATCTGTCAG TGGTGCAGGC CAACTGGCGA TGGAGGAACT AAAACTTTTA
ACTGAAAAAT ATCTTCAAGG AAATCCTCAA AAAAGTGAAG TTTTGCCATA CTCCCTTGCT
TTTAATTTGT TTTTACATAA TTCTCCTATG CTTTCAAATA ATTACTGCGA AGAAGAGATG
AAAATGGTTA ATGAGACAAG GAAAATATTA AATATTGCTG ATTTAAAGCT CTCTGCTACA
TGTGTTCGAG TCCCAGTTCT AAGAGCACAT TCTGAATCGA TCAACATTGA ATTTGCCGAT
GTAGTTGAGC CTAAAGAAGC TCTTGAAGAA TTAAAAAAAT CTCCTGGAAT TGAAATTATT
GAGGATTACA AAAATAATAG ATTTCCTATG CCAAATGACG TTATGGGAAG GGATAATATT
GCTGTTGGCA GGCTAAGAAC TGATATAAGT CATCCTCATG GATTAGAATT ATGGTTATGT
GGAGATCAAA TAAGAAAAGG AGCAGCTCTG AATGCTGTTC AAATAGCTGA GTTATTAATT
CCAAAAAAAT GA
 
Protein sequence
MRQFPYLPNR PLKVAVLGSS GAVGSELLKI LEQRDFPISE LVLLSSERSE GKKIIWKGEE 
LVTKKTTKEE FKNLDLVLAS AGGSISKKWL STIIDQNALL IDNSSAFRLD KNVPLIVPEV
NASDVLNHDG VIANPNCTTI LLTLVLAPLN KLSTIQRVIV STYQSVSGAG QLAMEELKLL
TEKYLQGNPQ KSEVLPYSLA FNLFLHNSPM LSNNYCEEEM KMVNETRKIL NIADLKLSAT
CVRVPVLRAH SESINIEFAD VVEPKEALEE LKKSPGIEII EDYKNNRFPM PNDVMGRDNI
AVGRLRTDIS HPHGLELWLC GDQIRKGAAL NAVQIAELLI PKK