Gene Hoch_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3433 
Symbol 
ID8545821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4743766 
End bp4745115 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content65% 
IMG OID646388100 
Producttransposase IS4 family protein 
Protein accessionYP_003267828 
Protein GI262196619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0300834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTGA GCACCGCCCT GCAATGTGTT GCGTCGTATC CTCCCCCGGA GGAGTTCTCT 
CGTCTTGCCC GCGATGTCGC GCCGGAATGG ATCGAGCAAG CGCTCGAGGC GACCGGGACG
GCGACCTTGC GCCGGCGCCG ATTACCGATG GAGCAGTTGG TCTGGCTGGT TATCGGCATG
GCCCTGTTCC GCGACCGTCC GATCACCGAG GTGGTCACCA GTCTGGACCT GGCGCTGCCG
AGCCCTGGCC ATCCTGAGGT AGCGCCGAGC GCGGTGGCGC AGGCCCGCGA CCGGCTGGGC
GAATCGCCTA TGGCGTGGCT GTTCGCCCAC AGCGCCGACC GATGGGCGCA TCAAAGCGCG
GCCGACGATA GATGGCGGGG GTTGGCGCTC TACGGGGTAG ATGGCACGAC GCTGCGGGTG
CCCGACAGCG AGGAGAATCG GGACCATTTC GGCCTGGCCA ACGGCGGCGC TCGCGGCAGC
AGCGGCTACC CTGTGGTTCG CCTGGCTGCG TTGATGGCGC TGCGCTCGCA TCTGCTGGCA
GCGGTGTCGT TTGGCCCATA TCAGGGCCAC GGCGAGTACT GGTACGCGGC GGATCTATGG
CCATGTTTGC CCGATAACTC GCTCGTCATC GTCGATCGAC ACTATTGGGC CGCCAACGTG
CTAATTCCGC TCCAGCAGGA CGGGTTGAAT CGGCACTGGC TCATCCGCGG GCGAAAAGGT
CTCAACTATC GTGTCGTCGA GCAGCTCGGG CCGAGCGACG AGTTGGCCGA GGTGAAGGTC
TCACCGCAGG CTCGGTCCAA GAACCCGGAG CTACCCCGGA CGTGGACGGT CCGAATCATC
CACTACCAGC GCAAAGGATT TCGACCACAG CGACTGTTTA CCTCACTGCT CGACCCGGTC
GCCTATCCGG CCGACGAGTT GGTTGCGCTC TACCACGAGC GTTGGGAGAT CGAACTCGGA
TACGACGAGG TGAAGTCCAA GATGCTCGCC AATGTCCCGT TGCGCAGCAA ATCCGTGGAC
CGAGCCCGCC AAGAGATCTG GGGGCTGCTC ATCGCCTACA ACCTCATTCG CCTCGAGATG
GCGCGAGTCG CCCACGAGGC TGGTGTGCCG CCCACGCGTA TCAGCTTCGT CACGGTCTTT
CGCCTCATCT GCGCCGAGTG GCTCTGGTGT AGTCACTCCA AGCCCGGCGC TATCCCCCGA
CATCTTCGGA ACCTGCGACG TAATATCCGT CGCTTCATCC TGCCGCCCCG CCGCACCGAA
CGCAGCTACC CGCGAGCCGT CAAGGTCAAG ATGAGCAGCT ACCCGCGGAA GCGACGTCCT
GCCCAGGCTC GGCCCGCGTC CGCCAAGTGA
 
Protein sequence
MHLSTALQCV ASYPPPEEFS RLARDVAPEW IEQALEATGT ATLRRRRLPM EQLVWLVIGM 
ALFRDRPITE VVTSLDLALP SPGHPEVAPS AVAQARDRLG ESPMAWLFAH SADRWAHQSA
ADDRWRGLAL YGVDGTTLRV PDSEENRDHF GLANGGARGS SGYPVVRLAA LMALRSHLLA
AVSFGPYQGH GEYWYAADLW PCLPDNSLVI VDRHYWAANV LIPLQQDGLN RHWLIRGRKG
LNYRVVEQLG PSDELAEVKV SPQARSKNPE LPRTWTVRII HYQRKGFRPQ RLFTSLLDPV
AYPADELVAL YHERWEIELG YDEVKSKMLA NVPLRSKSVD RARQEIWGLL IAYNLIRLEM
ARVAHEAGVP PTRISFVTVF RLICAEWLWC SHSKPGAIPR HLRNLRRNIR RFILPPRRTE
RSYPRAVKVK MSSYPRKRRP AQARPASAK