Gene Hoch_5674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5674 
Symbol 
ID8548088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7787422 
End bp7790163 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content73% 
IMG OID646390342 
ProductPeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_003270044 
Protein GI262198835 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0620443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTC AACGATCTCT TGTCTTCTCC TCCTGCCTGA GCCTGGCGCT GCTGAGCGCG 
GCCTGCGGTG GCCGCCAAGC GCCCGCGGGC GTCGCGGCCG AGTCGCCGCG CGCGGCCGTG
GCCGCCGAGG GCATGCCCCT GGGGCCCGAG GCCACCGCGC CCGCGCTGCG TCTCGAGGGC
GACGCCCGGC CGCTCGGCTA CGAACTCTGG CTCGATCTCG ACCCGGCGTC CGAAATCTTC
ACCGGCGAGG TGGTGATCGC GCTGGCCCTC GACCAGCCCA GCGAGCGCGT ATGGCTCAAC
GCCACCGAGC TGGAGATCCT CGCCGCCGAA CTCGCGCTCG CCGATGAGAC CCTGGCCCTG
AGCATGCTGC GCGCGCCCAG CGAGGACTTC GTCGGCTTCG ACTTCGGACG CACGGTCGAG
GCCGGCGAGG CCAGCCTGCG CATCCGCTAC CGCGGCCGCC TCCGCGACCG CGCCCAGAGC
GGTGTGTTCC GGCGCGAGGA CGAGGGCGCC CGCTACCTGT TCACGCAGTT CGAGCCGGTG
TCGGCGCGGC GCGCGTTCCC GTGCTTCGAC GAGCCCGGGT TCAAGGTGCC GTGGACGCTG
CGCATCGATG TGCCCGAGGG CGACCACGCG CTGACCAACG CGCCGCTGGT GAGCGAGCAG
GCCGGCCCAC GTCCCGGCAC CCGCAGCTTT CAGTTCGCGA CCACCGAGCC GCTGCCGAGC
TATCTCATCG CGTTCGCCGT CGGTCCCTTC GAGTTCGTCG ATCTCGGGAC GATCGGCCGC
GCCCAGACGC CGGCGCGCAT CGTGGTGCCG CGCGGCCGCA GCGCCCAGGT GCGCTACGCG
GCCGAGGTCA CGCCGCGGGT GCTGGACCTG CTCGAGGCGT ATTTCGACCA GCCGTACCCC
TACGCCAAGC TCGACAGCGT GGCGGTGCCG CAGCTCGGCT TCGCCATGGA GCATCCGGGC
CTCATCACCT ACGACGACAA GCGCATCCTG GCGCCGCCCA GTCAGGAGTC GGACGACTTC
CGCCGGCGCT ACGTCAGCGT CGCCCTGCAC GAGATCGCGC ACCAGTGGTT CGGCAATCTG
GTGACCATGC GCTGGTGGGA CGACCTGTGG CTCAACGAGT CCTTCGCCAC CTGGATGGCG
ACCAAGCAGC TCCCGGCCTT CGAGCCCGCC TGGCACAGCG GTCTGCCCGC GCTCGAGCGC
GCGCATCTGG CCTTGGCCGC CGACGAGCTG TTGGCCGCGC GCCGCGTGCG CGAGCCGGTC
ACCAGCACCA ACGACATCTT CAGCTCGTTC AACGCGATGA CCTACAGCAA GGGCAACGCC
ATCCTCACCA TGTTCGAGGG CTGGATCGGG CCCGAGCGCT TCCGCGCCGG CGTGCGCGCC
TATCTGCGCG AGCACGCGCA CGGCAACGCC GCGCTCGAGG ATTTTCTCGC CGCCCTGGCC
GCGGCCAGCG ATGAGCGCGC CAGCGCCGCG GTGCGCAGCT TCCTCGACCA GTCCGGGGCG
CCCGTGATCT CGGCCGAGGT GGTGTGCGAG CCCGGCGAGC CCCTGGTCAT CGAGCTGGCC
CAGGAGCATT TCCTGCCCAT GGGCTCCGAG GGCTCGACCA CCGATCGCGT GTGGCAGGTG
CCGGTGTGCG TGAAATACGG GCAGCGCACG GGCAAGCCGC GCGAGCAGTG CGTGTTGCTC
GACGACGAGG TCGGCACGAT CGAGATCCAG GGGCCGCGCC GCTGTCCGGC CTGGGTGCTG
GGCAACGCCG ATGCCAAGGG CTATTACCGG ACCTCGTACG GGCTCGGCGC GATCTCGCTG
CTGCTCGGGC CGGTGCGCGG CCAGCTCTCC GAAGCCGAGC GCGCGGGTGT GGCCGAGGAC
CTGCGCGCGC AGATCGCGGC CGGCGGCATC GAGATGGGCG AGGCGTTGGC GCTGGTGCCG
CGCCTGTTCG CCGATCCCAG CGAGCGCGTG GCCCGCAGCG CGCTCGAGCT GGTCGCCGGC
GCCCACCTGC ACATGGTGCC GGCGGCGCTG GACGATGCCT ACGGCCGCTT CGTGCGCGCG
AGCTTCGCCA AGCGCGCCCG GCAGCTCGGC TGGCAGACGC GGCCCGATGA CCCGGTGGCG
CGGCGCGAGC TGCGCGCGCT GCTGCTGCCG CTGGTCGCGC TCGCGGGCGG CGACGCGGAG
CTGCGGGCCG AGGCGCAGGA GCTGGCGCGG GCCTGGCTGG CGCAGCTCGA CGCGCTCGAC
GGCGAACAGG TCGAGAGCCT GGGCCCGGTG CTGGCGGTGG CCGTGGCCAC GGGCGGCGAC
GAGCTGCGCG CCGAGCTGGT GCGCGCGCTG GCCGGCGCCA AGGACGCCGG GATGCGCCGA
CGTCTGATCT TCGGGCTGGC GCACGCGCGC GCGCCCGAGC AGGTGCGCGC CAACCTGGCG
CTGGTGCGCG CGGGCGAGAT CGGTGGCCAC GAGGCCGATG ATCTGGTGCT GGTGCCGCTG
CGCAATCCCG CGCTGCGCGC CGACGCCTAC GCGCTCATCG AGGAGCAGCG CGAGGCGCTG
TTGGAGGCGC TGCCGCGCGG CGATCGCCGG CTGCTGATCC ACGCCGCCGC CGCGTTCTGC
GATCAGGAGC ATCTCGACCA GCTCGCGGCC ATGGCCGAGG GCGCGGACGC GATCCTCGGT
GGTCCCCAGG CGGTGGCCGA GGACCGTGAG CGTGTCAAGC TGTGCATGGT TCAGCGAGCC
GCGCACCGGC CCAGCGTCGA GGCGTTTTTG AAACAGCAGT GA
 
Protein sequence
MNVQRSLVFS SCLSLALLSA ACGGRQAPAG VAAESPRAAV AAEGMPLGPE ATAPALRLEG 
DARPLGYELW LDLDPASEIF TGEVVIALAL DQPSERVWLN ATELEILAAE LALADETLAL
SMLRAPSEDF VGFDFGRTVE AGEASLRIRY RGRLRDRAQS GVFRREDEGA RYLFTQFEPV
SARRAFPCFD EPGFKVPWTL RIDVPEGDHA LTNAPLVSEQ AGPRPGTRSF QFATTEPLPS
YLIAFAVGPF EFVDLGTIGR AQTPARIVVP RGRSAQVRYA AEVTPRVLDL LEAYFDQPYP
YAKLDSVAVP QLGFAMEHPG LITYDDKRIL APPSQESDDF RRRYVSVALH EIAHQWFGNL
VTMRWWDDLW LNESFATWMA TKQLPAFEPA WHSGLPALER AHLALAADEL LAARRVREPV
TSTNDIFSSF NAMTYSKGNA ILTMFEGWIG PERFRAGVRA YLREHAHGNA ALEDFLAALA
AASDERASAA VRSFLDQSGA PVISAEVVCE PGEPLVIELA QEHFLPMGSE GSTTDRVWQV
PVCVKYGQRT GKPREQCVLL DDEVGTIEIQ GPRRCPAWVL GNADAKGYYR TSYGLGAISL
LLGPVRGQLS EAERAGVAED LRAQIAAGGI EMGEALALVP RLFADPSERV ARSALELVAG
AHLHMVPAAL DDAYGRFVRA SFAKRARQLG WQTRPDDPVA RRELRALLLP LVALAGGDAE
LRAEAQELAR AWLAQLDALD GEQVESLGPV LAVAVATGGD ELRAELVRAL AGAKDAGMRR
RLIFGLAHAR APEQVRANLA LVRAGEIGGH EADDLVLVPL RNPALRADAY ALIEEQREAL
LEALPRGDRR LLIHAAAAFC DQEHLDQLAA MAEGADAILG GPQAVAEDRE RVKLCMVQRA
AHRPSVEAFL KQQ