Gene Hoch_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2048 
Symbol 
ID8544430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2829562 
End bp2831526 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content68% 
IMG OID646386751 
Producttransglutaminase domain protein 
Protein accessionYP_003266486 
Protein GI262195277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.586279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACC TTCTGGCGAT GTCCTTCGAT ATGCTGGCGT CTCCCTCGAT CCAGCTCCGC 
TCCTGGGCAG ACGCAAACGC AAACGCGGCC GCGACCGGCT GGGGATTTGC CTGGTATCCG
GGCGAAAACC TGGCCGCCCA GGTCATCAAA GATCCCGTCT CCACGGGCGA CACCGCGCTC
ACGCGGGTGC TGCGCGATTG GGATCGCTTC CGGGCGGTGA ACTTCGTCTG TCACATCCGG
GGCGCGGCCA AGCGGGTGAC GCAGCAGGAC ACGCCGCCCT TCGCCCGCAG CTACGCGGGC
CGCGACTGGG TGCTCGCGCA CAACGGCGAC CTCGAGCGCG GCTATCGCGA CAAGCTGTCG
CTGGGCGAGG CGCCGCAGTT CGAGCCGGTG GGGCGCACCG ACTCCGAGTG GATCCTGTGC
TGGCTGCTGG GCAAGATCGT GGCCAGCGGC GCGCGCTCGC TGGCCGGGTT TGGCTGGCCG
GCGCTGCACG CGCTGCTGCG CGAGATCAAC GCGCTGGGCA CGGCCAACCT GGTCTTCAGC
GACGGCCGCG ATCTCGTGGC CTTCCGCGAC GGCACCAGCT TCAACGAGCT GCACGTGACC
CGGCGCAAGC CGCCGCACGG GCACACCGAG CTGAGCAATC AGGCGGTCTC GGTGCGCTTC
GAGGGTCCCT TTGACCACAA CAACGCGATG GTGCTGGTGG CGACGCAGCC GCTGTCGCCG
AACTGGCGAC TGCTGGAGCC GGGCGAGATG TTGGTGAGCC GTCGCGGCGC GATCACGTGG
ACCAGTCATC CTGAGCAGCA CGCGACCATG ACAGCGCCGC CGCCGGTCGC CGCGGTGGTG
CAGCAGGGAC AGACACAAGT GCAGACGCAG GCGGAGACGC AGCAGGGACA GGGGAGAGAG
AATCAGGCGC AGCCGAACGC AGCGCCGCCG ACGCAGGATC CGCCGCGCAT CCACAGCCCG
GAGACTCTGA TCAGCAGCGC GCCGCTCGGG CTCGAGTCGC GGCTTCTGCG CGTCGTCCAC
CAGACCGTGT ACACCTACGA GCAGGCGGTG GAGCGCAGCT CGCACGTGTT CCGCCTGCAG
CCGCGGCACG ACGTGACCCA GAACCTGCTC GCGCACAGCT TGCGGATGAC GCCGACGGCG
CGCAGCACGC GCTATCACGA CGTCTTCGAT AACCACACAG TAGCCGTGGA TATCGAGTCT
CCGTATACGC AGCTAGAGCT GGTCGCGGAG TCGCTGGTGC GGGTGATGAA GCCAAACCCG
CTCGCGTCTC CGGATCGTCA TTCGACGATT CCGCTGGTGT GGATGCCGTG GCAGCGGCAG
ATGATGATGC CGTATCTGCT GCCGCCGGAG CTGCCCGAGA CCCAGCTCCG CGAGCTGTGG
GATTACGCGA TGAGCTTCGT CGAGCGCCAG GATTACAATC TGGCCGACAT TCTCGATGAT
CTCAATCAGA CAATTTATAA CGACTTCGCG TATCAGTCGG GCTCGACCAC GCTCGAGACC
ACGCCCTTCG ACGTGTACGT GAGTCGCCAC GGCGTATGCC AGGACTTCGC CAATCTGTTC
ATCTGCATCG CGCGGCTCCT GGGCGTGCCG GCGCGCTACC GCGTCGGCTA CATCTTCACC
GGCGCGGACT ACAAGAACAC GATCCAGTCG GAGGCCTCGC ACGCCTGGGT CGAGGTGTAT
CTGCCCCAGA TCGGCTGGCG CGGCTTCGAC CCGACCAACG GCTGCCAGTC GGGCATGGAC
CACGTGGGCG TGGCAGTGGG CCGCAACTTC CGCGACGCCA CGCCGACCCA GGGCACCTTG
TTCAAGGGCG GCGGCCCGGA GACCCTGAGC GCCTCGGTGC GCGTCGAGGC GGTGAGCGAC
GACGAGGCCG ACGATCTGCT GGCGCGATGG GGCAGCCCCG CCGAGCTGGC GCCGGCGCCG
GGCGCGCCCG CACAGCCGGC GCCGGCAGCG CAGCCCGCGA GCTGA
 
Protein sequence
MSHLLAMSFD MLASPSIQLR SWADANANAA ATGWGFAWYP GENLAAQVIK DPVSTGDTAL 
TRVLRDWDRF RAVNFVCHIR GAAKRVTQQD TPPFARSYAG RDWVLAHNGD LERGYRDKLS
LGEAPQFEPV GRTDSEWILC WLLGKIVASG ARSLAGFGWP ALHALLREIN ALGTANLVFS
DGRDLVAFRD GTSFNELHVT RRKPPHGHTE LSNQAVSVRF EGPFDHNNAM VLVATQPLSP
NWRLLEPGEM LVSRRGAITW TSHPEQHATM TAPPPVAAVV QQGQTQVQTQ AETQQGQGRE
NQAQPNAAPP TQDPPRIHSP ETLISSAPLG LESRLLRVVH QTVYTYEQAV ERSSHVFRLQ
PRHDVTQNLL AHSLRMTPTA RSTRYHDVFD NHTVAVDIES PYTQLELVAE SLVRVMKPNP
LASPDRHSTI PLVWMPWQRQ MMMPYLLPPE LPETQLRELW DYAMSFVERQ DYNLADILDD
LNQTIYNDFA YQSGSTTLET TPFDVYVSRH GVCQDFANLF ICIARLLGVP ARYRVGYIFT
GADYKNTIQS EASHAWVEVY LPQIGWRGFD PTNGCQSGMD HVGVAVGRNF RDATPTQGTL
FKGGGPETLS ASVRVEAVSD DEADDLLARW GSPAELAPAP GAPAQPAPAA QPAS