Gene Acid345_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3010 
Symbol 
ID4071565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3568817 
End bp3570013 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content56% 
IMG OID637985029 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_592085 
Protein GI94970037 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.609837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000185041 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAGCAG TTTGCTGGAA TGGACGTCAT GACATGCGGG TCGAGACGGT GGACGACCCG 
AAGATTTTGA ACCCTCGCGA TTGCATCATT AAAGTTACGC GCACCGCGAT TTGCGGCTCG
GACTTGCACC TTTACAACGG CCTTATCCCA ACGATGGAAG CGGGCGATAT TGTGGGGCAT
GAGTTCATGG GCGAAGTGGT GGAGATTGGG CCGCAGGTGA AGAAGCTGAA AGTTGGAGAC
CGTGTGGTGA TTCCCTTCAC CATCGCCTGC GGCAATTGTT TCTTTTGCCG ACAGCAGCTT
TGGTCGTCGT GCGATAACAC CAATCCGAAT GCGTACATTG CAGAAGCGCT GATGGGATAT
TCGCCGTCGG GATTGTTCGG TTATTCGCAC ATGACTGGCG GCTACGCAGG CGGCCAGGCG
CAGTACGTGC GCGTACCATT CGCAGATATC GGGCCATTGA AGATCGAAAG TGATCTGACG
GATGATCAAG TGCTGTTCTT GTCCGATGTC TTCCCTACTG GATACATGGC CGCGGAGAAC
TGCGACATCC AACCCGGCAA AGGACAAACG GTGGCGGTGT GGGGCTGCGG TCCGGTGGGA
CTGTTTGCGA TCAAGAGCGC GTTTTTGCTG GGCGCAGAAC AGGTGATCGC GATCGATCGC
TTCCCGGAGC GTCTGTACCT GGCGGAACAG GCCGGAGCAG AGACACTGAA CTACTCGGAG
ATTCCTGACC TGATCGAAGT TCTGAAGGAA CTGACTGGCG GTCGCGGACC TGATGCCTGC
ATTGATGCTG TCGGCATGGA GGCCCATGGC GTTTCGATCG ACGCCCTCGC CGATGAGGTG
AAGCAGGTGA TGAAGGTCGA GACAGATCGT CCGCTGGCAC TGCGGCAGGC GATCCAAGCG
TGCCGGAAGG GCGGAGTCGT TTCCGTTCCC GGCGTCTACG GTGGTTTCGT GGATAAGATT
CCGATGGGTG CGTTCATGAA CAAGGCGCTG ACCATGAAGA CCGGCCAGAC ACACATGATG
AAGTACATGA AGCCGCTGCT CGAACACATC GAGAAGGGCG ATATTGACCC CAGTTTCATC
ATTTCGCATC GGGTCACGAT TGATCAGGTA CCAGAGATGT ACGACGTGTG GCTTAAGAAA
CAGGACCATG TGACGAAGAT CGTGATCGAC CCGTGGGCGG AAAATATCGC GGCGTAA
 
Protein sequence
MKAVCWNGRH DMRVETVDDP KILNPRDCII KVTRTAICGS DLHLYNGLIP TMEAGDIVGH 
EFMGEVVEIG PQVKKLKVGD RVVIPFTIAC GNCFFCRQQL WSSCDNTNPN AYIAEALMGY
SPSGLFGYSH MTGGYAGGQA QYVRVPFADI GPLKIESDLT DDQVLFLSDV FPTGYMAAEN
CDIQPGKGQT VAVWGCGPVG LFAIKSAFLL GAEQVIAIDR FPERLYLAEQ AGAETLNYSE
IPDLIEVLKE LTGGRGPDAC IDAVGMEAHG VSIDALADEV KQVMKVETDR PLALRQAIQA
CRKGGVVSVP GVYGGFVDKI PMGAFMNKAL TMKTGQTHMM KYMKPLLEHI EKGDIDPSFI
ISHRVTIDQV PEMYDVWLKK QDHVTKIVID PWAENIAA