Gene Acid345_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3430 
Symbol 
ID4070314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4050665 
End bp4051744 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content60% 
IMG OID637985452 
Productoxidoreductase 
Protein accessionYP_592505 
Protein GI94970457 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.860811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTC TCAACCTCGC CATGATCGGC CAGGGCTTTA TGGGCCGGGC GCACTCCAAC 
GCCTTTCACC AGGTCAACCA CTTCTTCGAT ACCGCCTTCA ACCTCAACTT GAACGTCGTC
TGCGGACGCG ATGAAAACAA GCTTCGCGAG GTGGCTTCGC GCTGGGGCTG GTCCGAGACC
GAAACCGATT GGCGCAAACT CATTGATCGG CCAGATATCG ACGTGATCGA CATTGGAGCG
CCGAACCATC TCCATGCGGA GATCGCCATC GCGGCGGCCG AGGCAGGCAA GATTGTTTGG
TGCGAGAAGC CGCTGGCTAT GGACGTCGCC GAAGCCGAGC GCATGGTGAA AGCCGTGGCG
GGAAAACCGA ATCTCGTCTG GTTCAACTAC CGTCGCGTGC CTGCGATCGC GTTCGCCCGG
CAGCTTATTG GCGACGGACG CATTGGCGAG GTTTACCACT ACCGCGCTAC CTACCTGAAC
AGCAGCGGAC TCAACGCAAC GCGCGGCAAC ACCTGGCGCT ACAAGAAAGA AACTGCCGGT
TCCGGCGCCG CCGGCGATTT GCTCTCGCAC CTGATCGATC TCGCGCTCTG GCTCAATGGG
CCGATCAGCG AAGTCTGCGG GTTGCTCAAG ACGTTTGTTC CCGGCCGCGA GGTGGACGAC
GCTGCCATCA CCATGGCGCG CTTCGCCAAC GGCAGCGTCG GGACGCTCGA AGCCACGCGC
TACGGCACCG GCAATCGCAA CCGCAACATG TTCGAGATCC ACGGCTCAAA GGGCGCGCTC
GTTTTCAACC TCGAAGATAT GAACCGCCTC CAGTTCTACG ACGTCAACGA CGCGCCGGAG
AATGGCATGC GGACGATCCT CGCTACCGGC CCAGGGCATC CGTACGTGAA CGACTTCTGG
CCTCCGGGAC ACCTCATCGG GTACGAACAC AGCTTCATCG CAACGCTTGC CGACTTCCTT
AAAGCGCTGC ATGAAGGAAC GGAGTTTCAC GCGAACTTCG AAGACGGGCT GAAAGTGCAG
AAGCTGGTGG CGGCGGTGGA GGAATCGGCG GAGAAGAAGA AATGGACGAC ACTGGCTTAA
 
Protein sequence
MKTLNLAMIG QGFMGRAHSN AFHQVNHFFD TAFNLNLNVV CGRDENKLRE VASRWGWSET 
ETDWRKLIDR PDIDVIDIGA PNHLHAEIAI AAAEAGKIVW CEKPLAMDVA EAERMVKAVA
GKPNLVWFNY RRVPAIAFAR QLIGDGRIGE VYHYRATYLN SSGLNATRGN TWRYKKETAG
SGAAGDLLSH LIDLALWLNG PISEVCGLLK TFVPGREVDD AAITMARFAN GSVGTLEATR
YGTGNRNRNM FEIHGSKGAL VFNLEDMNRL QFYDVNDAPE NGMRTILATG PGHPYVNDFW
PPGHLIGYEH SFIATLADFL KALHEGTEFH ANFEDGLKVQ KLVAAVEESA EKKKWTTLA