Gene Hoch_6557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6557 
Symbol 
ID8548974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9001041 
End bp9002561 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content70% 
IMG OID646391219 
Productpeptidase S37 tripeptidyl aminopeptidase 
Protein accessionYP_003270918 
Protein GI262199709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.776042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.232019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCCC GCGCCCCTTG GAGTTCGTCG CAGTGCGCCC CGCGGCGCGA TCGTTCAGCC 
GTCCGCCGGC CGTGCGCGCG CGTCGCCCAC GCCGCGGCGC ACGCGTGTCT GCTGGCGTGT
CTGGCCGCGT CTGTGCTCGC GTGCACGGGC CCCGAGTCCG GCGACGATGA GCGCGACATC
GCCGAGCTGC TGGCCGAGGT GCCCGGGTTC GTCGCCGTCG AAGAGCTGCC CGCGCGTCCC
GGCAGCTACG GCCGCGCGTT CGCGATCGTG CTCGAGCAGC CCGTGGATCA CGCCGCCGCT
GAGGGCGCGC GCTTCCAGCA GCACATCACC CTGGCGCACG TCGATCGCGA CGCGCCCATG
GTCCTGGCCA GCACCGGCTA CCATAATTAC CTGGGCACCA CGCCCAGCGA GCCCACGTAT
CTGCTCAACG CCAATCAGGT GGCGGTTGAG CATCGCTACT TCGGCGACTC GCGGCCCGAG
CCGACAGACT GGAGCTACCT CACGGTCGAG CAGGCGGCCG CCGATCACCA CCGCGTGGTC
GAACTCTTGC GGCCCATCTA CAGCGGCGAC TGGGTGTCCA CCGGCCTGAG CAAGGGCGGC
GTCACCTCGC TTTTGCACCG CCGCTACTAT CCCGACGACG TCACCGCGAC CGTGGCCTAC
GTCGCGCCCG TGAGCTTCTC GGTCTTCGAC GAGCGTTACC GGACGTTTTT CGACGACCTG
GACGAACAAC TCGCCGAGAT CAGCGACGGT CCGGCTTGCA AGCAGCGCGT GCGCGACCTC
CAGCGCGAGC TGCTGCTGCA CCGCGACGAG GTCGAGGACT TCTTCGCCGA GCAAGCCGCC
CTGGTAGAGT CCAGCTTCGA GCGCGTCGGC GGATTGCAGC GCGTGCTCGA GGTCGCCATC
GTCGAGATGG AGTTCAGCTA CTGGCAGTAT TTCGGCCTCG ACTCGTGCTC CGGCACGCCC
GGAGCGAGCG CGCCGCTCGA GTCGATGGCC GGTTTCCTCG CCCTCGTCAA CGATCCCCTG
GGCATGGCCG ACGGCAGCAC CGCGCTGTTC GAGCCCTACT ACTATCAGGT GCTCACCGAG
CTCGGCTATC CGCTCGTGCC CCTGGCGCAC ATCGAGGATC TGCTCATCTA CGACTACGCC
GAGAGCCTGC ACTACTTCCT GCCCGAGGAG GTCGAGATGC AGCAGCCGGC GCCGCCATTC
GACCCCGCGC CCATGCGCGA GCTCTTCGAC TGGGTGCGCG ACCAGGGCGA GCGCGTGATC
GTGGTCTACG GCGGGCAAGA CCCGTGGACC GGCGGCGCGT TCGAGCTCGC CGGCGACGAC
ACCGCGTTCG TCGTGCTGCC GAGGGAGAAC CACGGCGCGC AGCTCCTGAA CTCCAACGAC
CGCAGCGCGG TGCAGCTACT CCAGTCCTGG GTGCCGCCGT TCCAGGCCAA CACCGGGGCG
AACCGCGACG CCGGTGTGGC CCCGGCCGTG CGCAGCCTCG GCACCGAGCT ACCGCGCCCG
CGCCCGCGCC TGGGCTGGTG A
 
Protein sequence
MLARAPWSSS QCAPRRDRSA VRRPCARVAH AAAHACLLAC LAASVLACTG PESGDDERDI 
AELLAEVPGF VAVEELPARP GSYGRAFAIV LEQPVDHAAA EGARFQQHIT LAHVDRDAPM
VLASTGYHNY LGTTPSEPTY LLNANQVAVE HRYFGDSRPE PTDWSYLTVE QAAADHHRVV
ELLRPIYSGD WVSTGLSKGG VTSLLHRRYY PDDVTATVAY VAPVSFSVFD ERYRTFFDDL
DEQLAEISDG PACKQRVRDL QRELLLHRDE VEDFFAEQAA LVESSFERVG GLQRVLEVAI
VEMEFSYWQY FGLDSCSGTP GASAPLESMA GFLALVNDPL GMADGSTALF EPYYYQVLTE
LGYPLVPLAH IEDLLIYDYA ESLHYFLPEE VEMQQPAPPF DPAPMRELFD WVRDQGERVI
VVYGGQDPWT GGAFELAGDD TAFVVLPREN HGAQLLNSND RSAVQLLQSW VPPFQANTGA
NRDAGVAPAV RSLGTELPRP RPRLGW