Gene Hoch_0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0238 
Symbol 
ID8542617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp355404 
End bp356873 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content70% 
IMG OID646385034 
ProductNa+/solute symporter 
Protein accessionYP_003264772 
Protein GI262193563 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.758968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCT GGACCATCGT CACCCTCATC TGCGGCGCCT ACCTGGTCGC CAGCCTGGTG 
ATGGGCGTGC TGCCGGGGCG CAAGGTGACC AGCGGCGTCT CCGGCTTCGT CGCCGGCGAC
CGGGCCATGA ACGTCGTGGT CCTGTACTTC GTCATGGGCG CCTCGGTGTT CTCGTCCTTC
GCCTTCCTGG GCGGCCCCGG CTGGGCCTAC GGGCGCGGCG CCGCGGCCTT CTACATCCTC
GCCTACGGCG TCGTCGGCAT GGTGCCGCTG TACTTCATGG GCCCGCGCGC GCGCCGCCTG
GGCAGCCGCT TCGGCTTCGT CACGCAGGCC GAATTGCTGG CCCACCGCTA CCAGAGCCGG
GCCATCTCGG TGCTGCTCGC GGTGCTCAGC ATCGCCGCCT TCGTGCCCTA CCTGACCCTG
CAGATGAAGG GCGCCGGCTA CATCGTGTCG GTCATCTCCG AGGGCCGCAT CGAGCCCTGG
GTCGGCGCCG CCGTGGCCTA CGCCGTGGTG CTCGTGTACG TGCTCGCCAG CGGCGTCATG
GGCGTGGGCT GGACCAACAC CTTCCAGGGC ATCTTCATGA TGGCGGTGGC GTGGTTTCTG
GGCCTGTACT TGCCCGAGCG CCTGCACGGC GGCATCGGCG AGATGTTCGC CGCCATCGAG
GCCAGCGAGC GCGCCGCCAT GCTCACCGCG CCCGGCCTCG ACGCCGCCGG CCAGCCGTGG
AGCTGGGCCA CCTACAGCTC GGCCGTGGCC ATCTCGGCGC TGGGCTTCTG CATGTGGCCG
CATCTGTTCA TGAAGAGCTA CGCGGCCAAG AGCGAGCGCG CGCTGCGCCT GAGCGCGGTG
CTGTACACGA CCTTTCAGGC CTTTCTCATC CCCATCCTGT TCATCGGCTT CGCCGGCGTG
CTGGCCTTCC CCGGCGTGAG CCCGAGCGAC ACCATCCTGC CGCACATCCT CACCCAGGTG
GATCTCTCGC CCGTGCTCGT CGGCCTGGTG TGCGCGGGCA CGCTGGCGGC CTCGATGTCC
TCGGGCGACG CCATCTTGCA CGCGGCCGCC TCCATCGGCG TGCGCGACGG CCTGCGCCCG
TTCATCGGCG CGCGCCTCAG CGACCGCGGC GAGGCCAACG CCATCCGCGC GCTCATCCTC
GTCATCGCCG GCGTGGCCTA CGTCTTCGCC GTGGTGGTCG ACGTCTCCAT CGTCGCCCTC
TTGCTCGGCG CATACGGCGG CGTCGCCCAG ATCTTCCCGC CCATGTTCGC GGCCTTTTAC
TGGCCGCGGG CCACGCGCGC GGGCGCGGTG GCCGGACTGC TCGGCGGCCT GATCACCAGC
ACCCTCTTCC TGGTCATGCC CGAGTGGCGG CCCTGGCCCA TCCACGAAGG CGCCTACGGC
CTGCTCGTCA ACCTCCTGCT GCTGGTGAGC GTCAGCCTGG CGACCCCGCC CTTGCCGATC
GCGCACCTGC GCGCGTACAT GTCCCGTTGA
 
Protein sequence
MERWTIVTLI CGAYLVASLV MGVLPGRKVT SGVSGFVAGD RAMNVVVLYF VMGASVFSSF 
AFLGGPGWAY GRGAAAFYIL AYGVVGMVPL YFMGPRARRL GSRFGFVTQA ELLAHRYQSR
AISVLLAVLS IAAFVPYLTL QMKGAGYIVS VISEGRIEPW VGAAVAYAVV LVYVLASGVM
GVGWTNTFQG IFMMAVAWFL GLYLPERLHG GIGEMFAAIE ASERAAMLTA PGLDAAGQPW
SWATYSSAVA ISALGFCMWP HLFMKSYAAK SERALRLSAV LYTTFQAFLI PILFIGFAGV
LAFPGVSPSD TILPHILTQV DLSPVLVGLV CAGTLAASMS SGDAILHAAA SIGVRDGLRP
FIGARLSDRG EANAIRALIL VIAGVAYVFA VVVDVSIVAL LLGAYGGVAQ IFPPMFAAFY
WPRATRAGAV AGLLGGLITS TLFLVMPEWR PWPIHEGAYG LLVNLLLLVS VSLATPPLPI
AHLRAYMSR