Gene Acid345_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1481 
Symbol 
ID4071651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1791448 
End bp1792743 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID637983490 
Producthomoserine dehydrogenase 
Protein accessionYP_590557 
Protein GI94968509 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGA CTTTGAAAGT GACACGCCTC CCAGAGAACG CTTCGGCGAA AGTTTGCAAG 
GTCGCACTGG TGGGCTTTGG CACCGTCGGC AGTTCGGTGG CGCGTCTGCT GGCCTCGCGT
AATGGAGAGC TTCCCTTCCA GTTGACGCAT GTGTACAACC GCGATGTCGC ACGCAAGAAG
GTGGATTGGG TCTCGAGCGA TGTGCAGTGG ACCGATACGT TCGATGATCT GATCAAATCG
GACGCGGAAG TGATCGTCGA GCTGGTTGGT GGACTCGATC CTGCATACCA GTGGGTGAAA
AGCGCACTGC TGGCGGGTAA GGCCATCGTG ACGGCAAACA AGCAGCTGAT TGCTCACCAT
GGCAGCGAGC TGTTCGCGCT GGCGCGGGAA CGCGATCTGC ACCTTGGTTT TGGCGCCTGC
GTTGCGGGCG GAGTGCCGGT GATCGCCGCA CTGCAAGATG GTCTGGCGGG CGATCGGCTG
CACAAGGTGC GCGGGATATT GAATGGAACG TGCAATTACA TCCTGACGCG GATTGAACAG
AGCGGGGCAT CGTTCGCCGA TGCGTTGGTC GAGGCTCAGA AGGCGGGACT TGCCGAGGCT
GATCCGACCG ATGACATTGA AGGTTACGAT GCGCGCGCGA AGCTGGTCAT CCTCTCGCGC
GTAGGCTTGA ACGCCGACGT GCGTGCGGAT GAAGTGCAGT GTCGCGCGAT CTCCGGCGTG
CAGCAAATTG ACTTCGACTA CGCGCACCAG CTTGGCTGCA CGATTCGGCA GATTTCGCGC
GCCGAGAAGC ACGGCAATAA GCTGTATGCG TCCGTCGAAC CGGCTTTGGT GCCGCAAACC
GAACCTCTGG CGCGTGTGGC AGGAAGCCAG AATTTGCTGG TGAGCACCGG CGAATTTGGC
GGCGAAACCG TGTTTGCGGG CTTCGGGGCG GGCGGCAATC CGACGGCGGT CGCGGTGGTT
TCCGATCTGT TGCACATCGC ACGCCACAAG CCGCGCGAAT ATGACGCTGC CGATCCGGCG
CGTTTCGCGG TGAGCAACGA CTTCGAGACG CCGCACTATG TGCGCTTCGT CATTCAGGAC
AAACCGGGAA TCATCGCGGC GATCGCGAAC GTGCTTTCGA AGAACGGGAT TAACATTGAC
TCCGTGTTGC AGAAGCCGGG TTGTCCGAAA ACAGAGCTGC CGTTCGTGAT GACGCTGGAA
ACATGCAGCG CGGCGAAGCT CGACAATGCG CTCGCCGAAA TTGCGAAACT CGACTTCCAC
AAGCAGGCGC CGTTCTGCAT GCCGATTTTG AAATAG
 
Protein sequence
MASTLKVTRL PENASAKVCK VALVGFGTVG SSVARLLASR NGELPFQLTH VYNRDVARKK 
VDWVSSDVQW TDTFDDLIKS DAEVIVELVG GLDPAYQWVK SALLAGKAIV TANKQLIAHH
GSELFALARE RDLHLGFGAC VAGGVPVIAA LQDGLAGDRL HKVRGILNGT CNYILTRIEQ
SGASFADALV EAQKAGLAEA DPTDDIEGYD ARAKLVILSR VGLNADVRAD EVQCRAISGV
QQIDFDYAHQ LGCTIRQISR AEKHGNKLYA SVEPALVPQT EPLARVAGSQ NLLVSTGEFG
GETVFAGFGA GGNPTAVAVV SDLLHIARHK PREYDAADPA RFAVSNDFET PHYVRFVIQD
KPGIIAAIAN VLSKNGINID SVLQKPGCPK TELPFVMTLE TCSAAKLDNA LAEIAKLDFH
KQAPFCMPIL K