Gene Acid345_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1942 
Symbol 
ID4071418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2336021 
End bp2337094 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content60% 
IMG OID637983954 
Productdeoxyhypusine synthase 
Protein accessionYP_591017 
Protein GI94968969 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1899] Deoxyhypusine synthase 
TIGRFAM ID[TIGR00321] deoxyhypusine synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.168438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.810917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAGC ATAACCATCA CGATGAAGGC GCTGGTATCG AGCGCGAACT GCACGATCCG 
ATTTCCGACA AGCTGGTTGC ACTTGAGCCG CTCGACCTTT CGAAGATCCA CACTGTCGAC
GACATGGTGC GCGCGATGGC GAAGACGGCG TTTACGGGAC GCCAACTTGG TCTCGCCGCC
GATGTACTCG AGGCGATGGC GCGCGACAAA GAGGCGTTTG TCGTGATGAC GCTGGCCGGC
GCGATGACCG TGGCGAAGCA GGGATTGATC GTCGCCGACC TGGTGGATGC GGGGATCGTG
AAGACGATCG TTTCTACCGG CGCGCTGATG GCGCACGGCC TGGTGGAAGC TGCGGGCAAG
TCGCACTTCG TGTACGACGA GAAGATGAAC GACGTCGAGC TTTACGAAGC TGGGTACAAC
CGCGTGTACG ACACGCTGGA GCCGGAGACG AACTTGAACT TCGTCGAAGG TATCGTCGCG
AAGATTCTCG AGAAGTGGGA CGCCGACGAG GTGATGTGTT CGTACAAACT GAACCGCGAG
ATCGGGCGCT ACCTGCACGA GCACGTGGCA GGGCGCGGGA TCCTGAAGTC GGCGTTCGAG
AAGAATGTGC CAGTGTTCGT GCCTGCGTTT ACGGACTCAG AGATGGGATT GGACGTTGCT
CTGCACAACC GCAAGCGCAT TAAAGAAGGG CGGAAGACGA TCGACTTCAA CCCGTTCCTC
GATCTCGAAT ACTTCGCTGA GAACCTATTG AAGCAGGTGC GGCTGGGAAT TTTCACCATC
GGCGGCGGCG TGCCGCGTAA TTGGTCGCAG CAGTTCGGGC CGTTCATCGA GCTGCGGCAT
CGTCGCGGTG GGGAAGACCT GCCGTTGAAG CGCTACCACT ACGGACTCCG CATTTGTCCA
GAGCCGGTGC ACTGGGGCGG TTTGTCGGGC TCGCCATACA GCGAGGCGAT TTCGTGGGGC
AAGTTCGTGC CTCCAGCGGA GGGTGGCAAG TTCGGCGAAG TGTTCGTGGA TGCCACCGTA
GGGCTGCCGA TGATTGTGGG CGCGGTGCTG GAGAGGCTGA AAAAGAACAA GTAG
 
Protein sequence
MSEHNHHDEG AGIERELHDP ISDKLVALEP LDLSKIHTVD DMVRAMAKTA FTGRQLGLAA 
DVLEAMARDK EAFVVMTLAG AMTVAKQGLI VADLVDAGIV KTIVSTGALM AHGLVEAAGK
SHFVYDEKMN DVELYEAGYN RVYDTLEPET NLNFVEGIVA KILEKWDADE VMCSYKLNRE
IGRYLHEHVA GRGILKSAFE KNVPVFVPAF TDSEMGLDVA LHNRKRIKEG RKTIDFNPFL
DLEYFAENLL KQVRLGIFTI GGGVPRNWSQ QFGPFIELRH RRGGEDLPLK RYHYGLRICP
EPVHWGGLSG SPYSEAISWG KFVPPAEGGK FGEVFVDATV GLPMIVGAVL ERLKKNK