Gene Hoch_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3197 
Symbol 
ID8545585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4406845 
End bp4408038 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID646387864 
Producthypothetical protein 
Protein accessionYP_003267592 
Protein GI262196383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.111821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGGT TTCGTTTCCC CCTCGTGGCG TCGCTGTTGC TGCCGCCGCT GCTGTTCGCG 
AGCGCGCAGC CGGCCGCGGG CCAGCCGGCC GCCTTTCCGC TCGAGGATGA CGCCGCGGTG
GCGCTGCCGC CGCTGCCCGG AATTCCGCTG CTGCCGAGCA CGGCCGTGGT GGCGCTGCCG
CCGCAGCCGC AGCCGCCACC GGCGCCGGTG AGCCTCGCGC TCGCGGCCGA GGGCGAGGCC
AACGTGACCA CGGCCAGCGC GGAGACGAGC GCGGACTTCG ATGACGATGT CGAAGGCGAG
GGAGAAGGCG AGGGCGAAAG CGGCCTCACG ACTGGCAAGA TCGTGAGCGC CAGCATCATC
GGCGGCATCC ACGCGACGCT GTACACCTGG GCGTATTTCG CCTGGTATCG GCCGCGCACC
AAATACGACG AGCTGACCTT TATCGACGAG GGCTGGTTCG GTCCCGGCAC CTACGCGGGC
GGCGCCGACA AACTCGGCCA CTTCTACTCC AATTACCTGT TCGTGCGCGG CACCGTGGGT
GTGCTCGAGG CCGGCGGCTG GGAGCGCAAG TGGGCGCTGC CGGCCTCGCT GGCGCTCACG
CTCAGCTTCT TCACCGCCAT CGAGATCAAG GACGGCTACC ACAAGGGATT TGGCTTCTCG
CTCCAGGACA TCACCGCCAA CCTCTCGGGC AACGCGCTGG CCGCGCTGCT CTTGGCGGTG
CCCGCCATCG ACCGCGCCAT CGACCTGCGC ATCGAGTATT TGCCCAGCAA GGCGTTCCGC
GACGAGCTGC GCATGGGCGG GGTCGACGCG GCCGAAGACT ATACCGGACA GTCGTTCGTA
CTCGCCTTTC ACCTGGGCTC GATCGAGCCG CTGCGGCGCT CGCGCTACCT GGGCTGGACG
CAGTACGTCG ACGTGGTCGG CGGCTACCAG GCGCGCAACT ACAAGCCCGC GCCCGCCGAC
CCCAGCGCCG AGCTGCCGAC GCAGGAGCTG TATTTCGGAC TAGCGCTCGA CATGCAGGCC
GTGATGCGCG CCTGGGACCG CTCGGTGTCT TCGCCCGGGT GGTCGAACGC CATCGACACC
ACGCGCGCGA TCTTCGAGTT CGTCCAGGTG CCCTACACCA CGCTCGAGCT GGTCGACGCC
GAGCGCGTCA ATCCTCCGGT GACGGACGAG TCCAGCGCCG GACTGCGCTG GTAA
 
Protein sequence
MSWFRFPLVA SLLLPPLLFA SAQPAAGQPA AFPLEDDAAV ALPPLPGIPL LPSTAVVALP 
PQPQPPPAPV SLALAAEGEA NVTTASAETS ADFDDDVEGE GEGEGESGLT TGKIVSASII
GGIHATLYTW AYFAWYRPRT KYDELTFIDE GWFGPGTYAG GADKLGHFYS NYLFVRGTVG
VLEAGGWERK WALPASLALT LSFFTAIEIK DGYHKGFGFS LQDITANLSG NALAALLLAV
PAIDRAIDLR IEYLPSKAFR DELRMGGVDA AEDYTGQSFV LAFHLGSIEP LRRSRYLGWT
QYVDVVGGYQ ARNYKPAPAD PSAELPTQEL YFGLALDMQA VMRAWDRSVS SPGWSNAIDT
TRAIFEFVQV PYTTLELVDA ERVNPPVTDE SSAGLRW