Gene Hoch_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1270 
Symbol 
ID8543652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1672348 
End bp1674150 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content70% 
IMG OID646385988 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003265723 
Protein GI262194514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.932295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000745074 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACAG CAACGTGGAA GTTAGCCTGC CAGATGGCCT GCATCGCGAC GCTCGTGTTC 
ACCGGCTGCA CGGGCGATCG CGGCCCGCAG GGCGCCGACG GTCCGCCCGG AACCCAGGGC
CCGGACGGCA GCGACGGCGC AAACGGCAAT GACGGCAACG ACGGCGCGAA CGGCGACGAC
GGCGCGGACG GCGCGGACGG CGCGGACGGC GTCACCGCGA TCGCGCTGCG CCTGCTCGGC
CGCTACGAGA GCGGCATCTT CGACCAGGGG GCCTCGGAGA TCGTCAGCTA CGACCCGGCC
ACGACGCAGC TCTTCCAGGT CAACGCCAAC TCCGGCGCGC TCGACGTCCT GTCCCTGGTC
GATCCCGCGG CGCCGACGCT GCTCAGCAGC ATCGACGTGG CCGCCGCCAT CGCCGACAAC
ACCGATATCA CCACCGTCCT CGGCGCGGTC AACAGCGTCG ATGTCCGCGG CGGCGTGGTC
GCGGCCGTTA TCGCCGCCGA CAGCGGCGAC GAGCGCGGCG CCATCGCGTT CTTCCGCGCC
GCCGATCACG CGTTCCTGGC CGGCTACGAG CTGGGATTTG GCCCCGACTC GCTCGCGTTC
AGCCCCGACG GCGACACCGT GATCGTGGCC AACGAAGGCG AGCCCCTCGA CGACTACACC
GTCGATCCGC CGGGCTCGGT GAGCGTCATC GATCTCAGCG TCGGCGTCGC CGCGGCGACC
ATTCGCGATC TCGATTTTAC CGCGTTTGAC GCCGGCGGAA CGCGCGCAGG CGAACTCGAC
CCCGCGGTCC GCATCTTCGG CATCAAGCAG CCGGGCGACG TCCCATCCAC GGTGTCCGAA
GACATCGAAC CCGAGTACGT CGCCTTCGCG CCCGACGGCG CCACCGCCTT CGTCAGCCTG
CAGGAGAACA ACGCCATCGC CGTCATCGAG GTGGCCGCGC CGCGCATCGC CCGCATCTTC
CCGGCCGGCG CCACCGACCA CGGCCGCATC GGCAACGAGC TCGATCCCAG CGATCGCGAC
CGCGGCATCG AGATCCGCAA CTGGCCGGTG TCGGGCCTGC GTTTGCCCGA CTCCATCGCC
ACCTACGATT ACCAGGGCCG GACCCTGCTG GTCACCGCCA ACGAGGGCGA CACCCGCGAC
TACGGCGGCT TCTCCGAGGA GGAGCGGATC CGCGACCTGG TGCTCGATCC CGAGGCATTC
CCCGACGCCG CCGCGCTGCA GAGCGACGCC CAGATCGGCC GCCTGCTCAC CACCTCGAGC
GCTGGCGACG ACGACGGCGA CGGTGATTTC GACCGCCTGT TCGCCATCGG CTCGCGCTCG
TTCTCGATCT ACACCGCGGA CGGCGCGCCG ATCTTCGACA GCGGCAACCA GTTCGAGCTG
ATCACGGCCT TCCGCCTCGA AGACCACTTC AACGCCAGCA ACGACGACAA CGAGGGCGAC
AGCCGCAGCG ACGCCAAAGG CTGCGAGCCC GAGGCGCTCA CCGTCGGCCG GGTGCGCAAC
GCCATGTTCG CGTTCATCGG CCTCGAGCGC ACCGGCGGCA TCATGGTCTA CAACATCTCC
AACCCGCACA GCCCCCGCTT CGTGCAGTAC GTCAACGACC GCAATTTCGC CGAAGAACCC
AGCCTGGGCG ACACCGACGG CGACGGCGTC GAGGAGAGCA ACCCGGCCGC CGGCGACCTC
GGCCCCGAGA GCATCCGCTT CATCCCGGCC GCCGACAGCC CCAACGGCAG CGCTCTGCTG
ATCGTGGGCA ACGAGGTCAG CGGCACCACG ACCGTCTACG CCATCGACAT CGTCCCCGAA
TGA
 
Protein sequence
MATATWKLAC QMACIATLVF TGCTGDRGPQ GADGPPGTQG PDGSDGANGN DGNDGANGDD 
GADGADGADG VTAIALRLLG RYESGIFDQG ASEIVSYDPA TTQLFQVNAN SGALDVLSLV
DPAAPTLLSS IDVAAAIADN TDITTVLGAV NSVDVRGGVV AAVIAADSGD ERGAIAFFRA
ADHAFLAGYE LGFGPDSLAF SPDGDTVIVA NEGEPLDDYT VDPPGSVSVI DLSVGVAAAT
IRDLDFTAFD AGGTRAGELD PAVRIFGIKQ PGDVPSTVSE DIEPEYVAFA PDGATAFVSL
QENNAIAVIE VAAPRIARIF PAGATDHGRI GNELDPSDRD RGIEIRNWPV SGLRLPDSIA
TYDYQGRTLL VTANEGDTRD YGGFSEEERI RDLVLDPEAF PDAAALQSDA QIGRLLTTSS
AGDDDGDGDF DRLFAIGSRS FSIYTADGAP IFDSGNQFEL ITAFRLEDHF NASNDDNEGD
SRSDAKGCEP EALTVGRVRN AMFAFIGLER TGGIMVYNIS NPHSPRFVQY VNDRNFAEEP
SLGDTDGDGV EESNPAAGDL GPESIRFIPA ADSPNGSALL IVGNEVSGTT TVYAIDIVPE