Gene Acid345_2490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2490 
Symbol 
ID4069859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2943057 
End bp2944112 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID637984507 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_591565 
Protein GI94969517 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGATA AGAAAATTCC CATAGGTATT TTGGGGGCGA CGGGTATGGT CGGGCAGCGC 
TTTATTCAGC TGCTCCAAGG TCACCCCTGG TTTGAGATTG TTTGGCTCGC GGCGTCGGAA
CGTTCCGAGG GCAAGAGCTA TGCGGAAGCC GTCCGCTGGC GTATGAAGAC GCGCCTGCCG
GAGAACATCG CCAACATGCA GATGTCCCCG GCGGACCCGG CCAATGCTCC TAAAGTGATT
TTTGCGGCGC TCGATTCTGG TCCAGCGAAA GAGCTCGAAC CGAAGTTTGC GAAAGCGGGC
TGCGCGGTAG TAACGAACTC CTCGGCGTTC CGCATGTATC CGGATGTGCC ACTTGTGGTA
CCGGAAGTGA ATCCGGACCA CATCGCGATC CTGGAGCACC AGCAGTGGCG CAAGGACACC
GGCGGCTATA TCGTGACCAA TCCGAATTGC TGCGCGATTG GTTTGGTAAT GGCCCTCGCT
CCTCTGCATC AGCGATTCAC CATCGACAGG CTATTCGTCA GCACGATGCA GGCAGTCAGC
GGAGCTGGCT ATCCGGGCGT GCCTACGCTC GACATTCTCG GAAATGTGAT TCCATACATC
GGCGGAGAAG AGCCGAAGCT CGAAGCTGAG ACTAAGAAGC TTCTCGGAAC TCACACTTCG
GAAGGCATCA AAGACGCTAC CTTCGCGATC ACCGCGCACT GCAACCGCGT GCCCGTGGAA
GATGGACACA CCGAGTGCGT ATCGCTGAGT TTCGCTCGTA AGGCGAGTGA AGCCGAGATT
CTCGATGCAT GGCGTCAGTA CTCTGGCGTG CCTCAGCAGT TGGGACTTCC CTCTGCTCCA
CCCGTGGTGC TGATATATGA CGAGCGTCCG GAGCGCCCTC AGCCGCGGTT TGATGTGGAT
GCCGGCGGTG GGATGACAGC GACGATTGGC CGCTTGCGAC CGTGCGGACT GCTCGATTGG
AAATTTGTGG TGCTCTCGCA CAATACGATC CGCGGTGCAG CGGGCGCGGC GATCCTGAAC
GCGGAGTTGT TGAAGGCGAA GGGATTCCTT TCGTGA
 
Protein sequence
MQDKKIPIGI LGATGMVGQR FIQLLQGHPW FEIVWLAASE RSEGKSYAEA VRWRMKTRLP 
ENIANMQMSP ADPANAPKVI FAALDSGPAK ELEPKFAKAG CAVVTNSSAF RMYPDVPLVV
PEVNPDHIAI LEHQQWRKDT GGYIVTNPNC CAIGLVMALA PLHQRFTIDR LFVSTMQAVS
GAGYPGVPTL DILGNVIPYI GGEEPKLEAE TKKLLGTHTS EGIKDATFAI TAHCNRVPVE
DGHTECVSLS FARKASEAEI LDAWRQYSGV PQQLGLPSAP PVVLIYDERP ERPQPRFDVD
AGGGMTATIG RLRPCGLLDW KFVVLSHNTI RGAAGAAILN AELLKAKGFL S