Gene Hoch_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1413 
Symbol 
ID8543795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1898893 
End bp1900311 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content62% 
IMG OID646386125 
Producttransposase IS4 family protein 
Protein accessionYP_003265860 
Protein GI262194651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAATCGA GCAGAGCCAA CATCCACCGG CGATACCATA AGGTGCCGGC GCTACGATTC 
GCGGAGGATG GGCGCCTGAC CGCGTACGCA GGATTAGTGC TGGTCCAAGT GCTGATAGCA
GCACTTGGCC TGAAAAAGCG GCTACGGCGC TGCTTCAGCC ATCTCGGTAA AGACAGCATC
TATGGGATGG GTAAGATGGT GTTGCTGCTG CTCGTGGCCA TCCTGCTAGG ATGCCGGCGT
CTGAGAGACC TCGACTACTG TCGTGAAGAT CCGCTCCTGA AACGAGTCGT GGGCGTGAAG
CGGTTACCGG ACGTAGCGAC GATATCGCGC GCCCTGACAA AGATGGATGA GCGAGGTGTG
GAGGGAATGC GAAGCGAGGT GCGGGGGCTG GTGCTAGAGC GCTTGGAAGG AGAAGCGCAG
AGCCGGGTGA CGGTAGACTT CGATGGCTCG GTGCAGACGA CGCGAGGTCA CGCAGAGGGA
ACAGCGGTGG GGTACAACCC GCTCAAGAAA GGCGCTCGGA GCTACTATCC GCTGTTCTGC
ACAGTGGCGC AGACAGAGCA GTTTTTCGAC GTGCTGTTCC GCTCAGGCAA CGTGCACGAC
TCCAACGGGG CCAGTGGCTT CATGAGCGCA TGCCTCAGCG AGTTGCACGA GCGGCTGCCT
CGCGCGCAGC TCGAGACCCG GGTCGATAGC GCGTTTTTCA ACGAGCGGGT GCTCGCCACG
CTGCACGAGC GTGGAGTGGA GTTTAGCTGT TCGGTGCCGT TCGAGCGGTT TCCTGCGCTC
AAAGCGTTGG TGGAGGAGCA GCGGGAATGG CGTGCTCTGG ACGAGCGATA CTCCTACGCC
GAAGTCGCCT GGAAGCCGCA ATGTTGGGGC GTGAGGTATC GCATCCTGCT CGTGCGGCAG
CGCAAGAAGC CGCGTAGCCC TCGACCCATC CAGCTCGACC TGTTCGTTCC GTTCGACGAG
GTGTACGAGT ATACAGTGGT GGCGACCAAC AAGAAGGTCT CGCCTCGGGC GGTGCTCGGT
TTCCATCACG GGCGTGGCTC GCAGGAGAAG CTCTTTGGCG AGGCCAAGCA GCATGCCGCT
CTCGACGTGA TTCTGGGACG GCGTCAGAAG GCCAACCAGC TCTTCTCCCT CTGCGGCATG
CTCGCTCACA ATTTGTCCCG CGAGATGCAG ATGATGCGGT GGCCCAAAGA GCGCCCTACG
CAGCGCAAGC GCCCTGCCCA CTGGCGCTTC CACAGCCTTG GCACGCTACG GCAGCGATTG
TTCCATCGCG CGGGTCGCCT GCTTCGTCCG CAAGGCCATC TCACCCTCGA ACTCAACGCG
AATTCCGACG TGCGGTCCGA ATTTGAAGGC TACCTCGACG CCATGCTCCA CGGCGCTCGT
TTCGGTGCCG CCTCCGACAG TTCTGCCGCC CAAGCCTAG
 
Protein sequence
MKSSRANIHR RYHKVPALRF AEDGRLTAYA GLVLVQVLIA ALGLKKRLRR CFSHLGKDSI 
YGMGKMVLLL LVAILLGCRR LRDLDYCRED PLLKRVVGVK RLPDVATISR ALTKMDERGV
EGMRSEVRGL VLERLEGEAQ SRVTVDFDGS VQTTRGHAEG TAVGYNPLKK GARSYYPLFC
TVAQTEQFFD VLFRSGNVHD SNGASGFMSA CLSELHERLP RAQLETRVDS AFFNERVLAT
LHERGVEFSC SVPFERFPAL KALVEEQREW RALDERYSYA EVAWKPQCWG VRYRILLVRQ
RKKPRSPRPI QLDLFVPFDE VYEYTVVATN KKVSPRAVLG FHHGRGSQEK LFGEAKQHAA
LDVILGRRQK ANQLFSLCGM LAHNLSREMQ MMRWPKERPT QRKRPAHWRF HSLGTLRQRL
FHRAGRLLRP QGHLTLELNA NSDVRSEFEG YLDAMLHGAR FGAASDSSAA QA