Gene Hoch_5303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5303 
Symbol 
ID8547715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7293914 
End bp7295164 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID646389977 
Producthypothetical protein 
Protein accessionYP_003269681 
Protein GI262198472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGG CTCGCATGCA ACGCGCCCCC GCGCTCGCCC GCGTGCTCCG CGCGGGCGTC 
TTCACCGCCG CCGCCGCGCT CGGCGCGCTC GCGCTCGCGC TCGCGCCGCT GCCCCCCGGC
GCGCCCGCCC TGCTGGCAGA CGCGGCCGCG CAGTCGCAGC CGCCCTCGTC GGCGTCGGGG
TCGGCGCCAG CGCCCGACGA GCGCGACCCG GCCCCGGTCC CCGGCGACGA CAGCGAACGC
GCGGACGCCG CGATCGCGGC CGCAGCGGAC AAGGACACCG ACTTCGCGGA CGACGCAGAC
GACGCTGACG ACGCAGACGA CGCAGGCGAC GACGACACCG ATGCGCCTGT CGAAGATGGC
GATCGGCCGT GGGCCGAGGG AGTGGCCGAG GCCGACCAGG AGCGCGCGCG TGCGCTCTAC
GAGGAAGGCA ACGGGCTGAT GCGCGAGTAT CTGCTCGAGG AGGCGATGGA GAAGTATCGC
CAGGCGCTCG CGCACTGGGA TCATCCGGCG GTTCACTACA ACCTGGCGCG CGTGCTCGAG
AGTTTGAACC ACGCCGATGA GGCCGACTTC CACATGGAGT ACGCCCTGCG CTACGGCGCG
GCCGCGTTCT CGGCGCAGCA GTATCCCCAG GTGCTCAACT TCCGGCGCGT GCTCGACCGC
AAACTCGGAC ACCTGTCCTT GTTCTCTGAC GAGCGCGGCA TCGAGGTGCT GGTCGACGGC
ACCGTGGTGC ACGCCGGCAT CGGTCAGGTG ACGCTGCGGC TGCTGCCGGG CGACCACGTC
ATCACGGTGC GCAGCGACGA GGTCGCGCCG ACCACGCATC GCGTACACCT CGATCCCGGC
GAACGCGTGC AGGTGACGCT GGCGACGCGC GTGCGCTGGC GCACCTGGGA ACCGTGGATG
GTGCTCGGTC TCGGCGGTCT GGTGGCCACC AGCGGCGGCC TGATGCAGTG GGCCGCGTTC
GAGAACAACG CGCGCTTTCG CGAGCGTTTC GCGGCCGAGT GCAACAGCGG CTGCAACGAC
GAGAACAACG CACAGCTCGC GGCGCTGCGC GGACGCGCGC ACTGGCAAAA TCGCGTCGCC
GTCGGCGCGA TGCTCACGGG CAGCGCGGTC ATCATCGCCG GCTCGCTGAT GCACGCGCTC
AACCAGTCGC GCTTCGCCGA GATCGATGTC GGGCAACGCG ACAACGCGCT CACCGTGCTA
CCCTCGGTCC ATTCGGATGG CGCCGGTTTT GCCGTCCATC TGTCCTTTTA G
 
Protein sequence
MISARMQRAP ALARVLRAGV FTAAAALGAL ALALAPLPPG APALLADAAA QSQPPSSASG 
SAPAPDERDP APVPGDDSER ADAAIAAAAD KDTDFADDAD DADDADDAGD DDTDAPVEDG
DRPWAEGVAE ADQERARALY EEGNGLMREY LLEEAMEKYR QALAHWDHPA VHYNLARVLE
SLNHADEADF HMEYALRYGA AAFSAQQYPQ VLNFRRVLDR KLGHLSLFSD ERGIEVLVDG
TVVHAGIGQV TLRLLPGDHV ITVRSDEVAP TTHRVHLDPG ERVQVTLATR VRWRTWEPWM
VLGLGGLVAT SGGLMQWAAF ENNARFRERF AAECNSGCND ENNAQLAALR GRAHWQNRVA
VGAMLTGSAV IIAGSLMHAL NQSRFAEIDV GQRDNALTVL PSVHSDGAGF AVHLSF