Gene Hoch_2696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2696 
Symbol 
ID8545083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3715683 
End bp3717845 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content70% 
IMG OID646387390 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003267119 
Protein GI262195910 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.8559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.694417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACC TGCTTCAGGT AACCAAACTC TCCGCGGCCG GGCTCGCGCT CGGCTGGATC 
GCGGCGTGCG GCGGCGCGCC CCAGGCAACG CAGAGCCCGG GCCCGACGGG CGCCGAGCCG
GGCTCCGAGC AGGCCGCGGG CGGCGACGCC CAGGCCGAGA GCGAAGAGCT TTTGCCCATC
GCCGAGAGCA CGCCGCACAC GTTTTCGATC CGCGACATGC TAGCCATGGA CCGCGTCGGC
AGCCCGGCGC TGTCCCCGGA CGGCAAGACC ATCGCTTACG CGCTGCGCCA GACCGATCTG
CGCGCCAACG GCGCGCGCGT CGACATCTGG ATGGTGGCGC TCGACGACGG TATGCCGCAG
CGCTTCACCA CCCTGCCCGG CGGCGAGAGC TCGCCGACCT GGGCGCCCGA CGGCAGCGCG
CTGTACTTCC TGTCCGGACA CTCGGGCTCC TCGCAGGTGT GGAAGCAGGC CATCGCCGGC
GGCGAGGCCA CGCAGGTGAC CGAGCTGCCG CTCGACGTGA GCAGCTTCCA GATCTCGCCG
ACGGGCACCC ACCTGGCGGT GTCCATGGAG GTCTTCGTCG ACTGCGACAC GCTGAGCTGC
ACCACGGATC GCATGAACAC GCGCAGCCAG GCGGCGACCA CCGGCGTGGT CTACGAGAGC
CTGTTCGTGC GCCACTGGGA CACCTGGAAG GACGGCCGCC GCAGCCACCT GTTCACGCTG
CCCATCGACG GCGGCGAGCC CGTGGACGTG ACCGCCGGCC TCGACGCCGA CGTGCCCTCC
AAGCCCTTTG GCGGCGGCGA GGACTACACC TTCCACCCCA GCGGCGACAC CCTGGTGTTC
GCGGCCCGCG ACGCCGGCAA CGAGGAGCCG TGGTCGACCA ACTTCGATCT GTACGCGGTG
CCCGTGGACG GCAGCGAGCG GCCGCGCGCG CTCACGGCGG ACAACGCCGC CTGGGACGCG
CATCCGGTGT TCTCGCCCGA CGGCGCGCTG CTGGTGTATC TGGCCATGAA GCGCCCGGGC
TTCGAGGCCG ACCGCTTCCG CGTGATCGCG CGCGAGTGGC CCGGCGGCGA GAGCCGCGTG
CTCACCGAGA GCTGGGACCG CTCGATCAGC GACCTGGGCT TCACCCGCGA CGCCAGCAAG
CTGCTGGTGA CCACCTGGGA TCTCGGACGC AAGTCCCTGT TCTCGGTCGA CCTGCCCAGC
GGGCAGTCGC AGCGCCTGGT CGAGGGCGGC TACGTGTCGG CGCCCATGGA CACGGGCGCG
CGCATCGCGT TCCTGCGCGA TGATCTGCGC TCGCCGGCCG AGCTGTGGAC CGCGGCGCCC
GATGGCTCCG ACGCGCGCCC GCTCACCCAT CACAACGACG CCAAGCTGGC CCTGGCGCAG
ATGGGCGAGA CCGAGCAGAT CACCTTCCGC GGCGCGGGCG GCCGCAACGT CTACGCGTGG
GTGGTCAAGC CGGCCAACTT CGATCCCGAT CAGCGCTACC CGGTGGCCTT CCTCATCCAC
GGCGGGCCGC AGGGCTCGTT TGGCGATCAC TTCCATTACC GCTGGAACCC GCAGGCCTAC
GCCGGCGCCG GCTACGCCGT GGTCATGGTG GATTTCCACG GCAGCGTCGG CTACGGCCAG
GACTTCACCG ACGGCATCCG CGGCGACTGG GGTGGCAAGC CGCTGGCCGA TCTCAAGGCC
GGTCTGGCGG CCGCGCTCAA GGCCAACCCG TGGATGGACG GCGAGCGTGT GTGCGCGCTC
GGCGCCTCGT ACGGCGGCTA CATGGTCAAC TGGATCGCCG GTAACTGGAG CGACCGCTTC
CGCTGCCTGG TCAACCACGA CGGCATCTTC GACAACCGCT CGATGTACTA CGCGACCGAG
GAGCTGTGGT TCCCCGAGTG GGAGCACGGC GGGCCCTACT GGGAGAACGC CAAGGGCCAC
GAGAAGCACA ACCCGGCCCA CCACGTGGGC AAGTGGAAGA CGCCGATGCT GGTCATCCAC
GGCGCCCTCG ACCACCGCGT GCCGCTCGAG CAGGGCCTGG CCACGTATAC GGCGCTGCAG
CGCCGCGGCA TTCCCAGCAA GTTCTTGTAC TACCCGGACG AGAACCACTG GGTGCTCAAG
CCCGCCAACA GCATCCAGTG GCATGACGAG GTCATGTCCT GGCTCGACAC CTATCTGAAG
TAG
 
Protein sequence
MSNLLQVTKL SAAGLALGWI AACGGAPQAT QSPGPTGAEP GSEQAAGGDA QAESEELLPI 
AESTPHTFSI RDMLAMDRVG SPALSPDGKT IAYALRQTDL RANGARVDIW MVALDDGMPQ
RFTTLPGGES SPTWAPDGSA LYFLSGHSGS SQVWKQAIAG GEATQVTELP LDVSSFQISP
TGTHLAVSME VFVDCDTLSC TTDRMNTRSQ AATTGVVYES LFVRHWDTWK DGRRSHLFTL
PIDGGEPVDV TAGLDADVPS KPFGGGEDYT FHPSGDTLVF AARDAGNEEP WSTNFDLYAV
PVDGSERPRA LTADNAAWDA HPVFSPDGAL LVYLAMKRPG FEADRFRVIA REWPGGESRV
LTESWDRSIS DLGFTRDASK LLVTTWDLGR KSLFSVDLPS GQSQRLVEGG YVSAPMDTGA
RIAFLRDDLR SPAELWTAAP DGSDARPLTH HNDAKLALAQ MGETEQITFR GAGGRNVYAW
VVKPANFDPD QRYPVAFLIH GGPQGSFGDH FHYRWNPQAY AGAGYAVVMV DFHGSVGYGQ
DFTDGIRGDW GGKPLADLKA GLAAALKANP WMDGERVCAL GASYGGYMVN WIAGNWSDRF
RCLVNHDGIF DNRSMYYATE ELWFPEWEHG GPYWENAKGH EKHNPAHHVG KWKTPMLVIH
GALDHRVPLE QGLATYTALQ RRGIPSKFLY YPDENHWVLK PANSIQWHDE VMSWLDTYLK