Gene Acid345_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1443 
Symbol 
ID4071632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1742240 
End bp1743808 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content58% 
IMG OID637983452 
Productsuccinate-semialdehyde dehydrogenase (NAD(P)+) 
Protein accessionYP_590519 
Protein GI94968471 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCG ATACCAAGAT CGCTGCTCCG GTATTAATCA GCACCAATCC CGCCACGGGT 
GAAACCGTCG GCACCTATTC CTGCACCAGC GTCGACGAAG TCCACGAAGC CGTTGACATC
GCACGACGCG CACAGCCAGC CTGGGCCGCG CTCGGCGTTC AGAAACGCGT CGCCATCATC
CGGCGCTTCC GAAAGCTACT GAATCAACAG GCAGGCGAAG TCGCGGAGTT AATCACGCGC
GAAGCCGGCA AACCGATTCC TGAAGCGATG GGCGCCGAGA TACTCGTCGT GCAGGATGCT
GCCGAATTCG TCGCCCGCCA CGCTGCCGAG ATTTTGCAAC CCCAGCCTGT TCCGCACTCC
AATCCCGCGA TGAAGACCAA GCGCGGAACG CTCCACCACG AACCGCACGG CGTGATTGGA
ATTATTTCCC CGTGGAACTA TCCATTTTCT ATTCCTTCCA CCGAAACGCT CGCCGCGCTC
GTTCTTGGGA ATGCCGTGGT TCTGAAGCCG TCGGAACTCA CACCGGCATG TGCTTTGAAG
TTGCAATCAT TGCTCCACGA AGCCGGCGTT CCGAAAGAAA TCATGCAGGT TGTCCTCGGT
GAAGGACCAG TCGGTGCTGC GCTCATCGAC TCCAAGATCG ACAAAATTAT CTTCACCGGA
AGTGTCGCCA CCGGCCGTCG CGTGAGCGTT GCCGCTGCGC AAAAACTTCT ACCGTGTGTC
CTCGAGCTTG GCGGTAAAGA TCCCTTCATC GTCTTCGACG ATGCCGACCT CGACGTTGCC
AGCAGCGGCG CCGTCTGGGG CGCCTTCATG AATGCCGGAC AGACTTGCTT GTCTGTCGAA
CGCTGCTACG TGCAGCGCTC GGTCTTCGAA AAATTCGTCA ACATGTGTGT GAAGAAGGCC
CAAGCGCTCA AGGTCGGCGA CGGCTTCGAT CGCGACACCG ACGTCGGACC AATGATCGAC
ACTCGCCAGT TGCGGATCGT GGAGAGTCAG GTCGCCGACG CCCTCGATAA AGGTGCGAAG
GTCCTGACGG GTGGAGAACG CCTTACGCAA CTCGGCCCAA ACTTCTACGC TCCCACTGTC
CTCACCAACG TCACTCCAGA CATGAAGCTG ATGCGTGAAG AAACATTCGG CCCGCTGCTT
CCCGTCATCC CCTTCGACAC CGACGAACAA GCCATCTCGA TGGCCAATGA ATCGGAATTC
GGTCTCGCGG CCAGTGTCTG GACGAATAGT CGCTCTCGCG GCGAAGCCGT TGCCGGGAAA
ATCGAAGCCG GCACCGTCAT GGTGAACGAT GCCATCTCCG GATTCGGAAT CTGCGAAGCG
CCGCACGGCG GATTCAAAGC CAGCGGTATC GGCCGCACTC ACGGGTTGTT AGGAATGCAG
GAAATGGTGC GGGTGCGCTA CGTCGACGTA GATCGCGTTG TGATGAAAAA ACCCTGGTGG
TACGGCTATA AAGGGATGTA TCGGGAGCAG ATCCACGGCT TCGCCGATAT GATGTTCGGC
CACTCGCCTG CCAAGCGCAT TAAGGGTGCG TTAAACTCAA CTAAAATTTT GACTAGGCCT
AAGCTGTAG
 
Protein sequence
MATDTKIAAP VLISTNPATG ETVGTYSCTS VDEVHEAVDI ARRAQPAWAA LGVQKRVAII 
RRFRKLLNQQ AGEVAELITR EAGKPIPEAM GAEILVVQDA AEFVARHAAE ILQPQPVPHS
NPAMKTKRGT LHHEPHGVIG IISPWNYPFS IPSTETLAAL VLGNAVVLKP SELTPACALK
LQSLLHEAGV PKEIMQVVLG EGPVGAALID SKIDKIIFTG SVATGRRVSV AAAQKLLPCV
LELGGKDPFI VFDDADLDVA SSGAVWGAFM NAGQTCLSVE RCYVQRSVFE KFVNMCVKKA
QALKVGDGFD RDTDVGPMID TRQLRIVESQ VADALDKGAK VLTGGERLTQ LGPNFYAPTV
LTNVTPDMKL MREETFGPLL PVIPFDTDEQ AISMANESEF GLAASVWTNS RSRGEAVAGK
IEAGTVMVND AISGFGICEA PHGGFKASGI GRTHGLLGMQ EMVRVRYVDV DRVVMKKPWW
YGYKGMYREQ IHGFADMMFG HSPAKRIKGA LNSTKILTRP KL