Gene Hore_22220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22220 
Symbol 
ID7313770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2422006 
End bp2423430 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content35% 
IMG OID643612674 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002509962 
Protein GI220933054 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA TACCCCATAA TTATGAGTAT GAGATTGATC TCACTGAATA CCTCGGAATT 
TTAAATAAAA GAAAATGGCT GATACTGGCT ATAACTATCC TGGCAGGTGT TATGGGTTAT
ATCCTTACCT CATACAAACC GGCGGTATTT CAGTCAGATG CCCTGTTAAT GATAGATGAA
ACTCCTACTA CTTTAAATCA AATTGAACTA TCCCCTTTTA ACAACAATAA CAGGGATCTT
ATCACTTACA GTAAATTGTT AAAAACTAGG AAACTCCTTA AAAAAGTCAG TCAACACTTT
GACGAAAATA AGGTTTCAGT TAAGTATTTA ATTACTAACT TAAACATTGA GATTATACCT
GATACCAGGT TAATTAAAAT TTCCATAAAA CATACAGACC CGGTAATAGC CCGGCAGATT
ATATCATATC TAATAGAAGA ATTTATTATA AATAATAGGA ATTTAAAAAA ATCAGCAACG
GTTAATGCCC GAAACTATGT AGCCCGGCAA CTGGAGAAGG TCTCCAGGGA TTTAAAAAAT
ATTGAGGCAG AAGTCAGACA TTATAAAGAA GAAAACAAAA GCCTTGTTCT CTCTAATTTT
ACCCAGAAAA TTATGCAATC AATGCTGGAC CTGGAAGAAA ATCTGGCAGA AGTGGAAATC
AATATAAAAA CACATCGGGC TGCCCTGAAC CACTTATATA ACAAATTACA GTCGTCCCGG
GAGCTTATAT TATCATCTAA AACCCTTTCC AAAAACCCTG CTTATACAGA ACTTAAAAAT
AGACTGACAG AACTGGAAAT CGAGCTAGAC TCTTTGACAA CAGTCTATAC GGACAAACAC
CCTGAGATTA TTAAATTAAA AGCTGAGAAA AAATCAATCC TTAACGAAAT TTCTAATACT
CTTGGTGAAG TTATAACCTC AACCATCTAT ACGACAAACC CTGTTTATAA TAATCTAAAA
CAGGAATTAG TCAAACTGGA AACAGAGTTA ACCTCCCTGA AAGCGCAAAA AGAGTCTCTG
AATATCCAGT TTCAAAAGAT TAAAGCAAAA ACAGAGCAGT TACCCAAAAA AGAACTGGAG
TACTCAAGGT TACTCAGGAA ACTGGAGGTT TCTGAAAAAC TATATACCAT GCTTTTGACC
AGATACCAGG AATTAAAGAT AACTGAGGCC ATGAAGGTCT CTGATATTAT TACAGTCGAT
CCCCCGGTCG TGCCCGAAAG CCCGGTCGGA CCCAATATGA AATTAAACCT TGCCATTGCC
ATTATAATGG GTCTCTTTGT TGGTGTCTTT ATAGCTTTTA TTCTTGAATT TATAAATAAT
ACTATTCAGC GGGTTGAAGA AATTGAGGAA ATAACAGATG TACCAATAAT AGGTTATATT
CCTTATATAG ACAAAAAAAA TGATGGACGT GATCATAATA ATTAA
 
Protein sequence
MDNIPHNYEY EIDLTEYLGI LNKRKWLILA ITILAGVMGY ILTSYKPAVF QSDALLMIDE 
TPTTLNQIEL SPFNNNNRDL ITYSKLLKTR KLLKKVSQHF DENKVSVKYL ITNLNIEIIP
DTRLIKISIK HTDPVIARQI ISYLIEEFII NNRNLKKSAT VNARNYVARQ LEKVSRDLKN
IEAEVRHYKE ENKSLVLSNF TQKIMQSMLD LEENLAEVEI NIKTHRAALN HLYNKLQSSR
ELILSSKTLS KNPAYTELKN RLTELEIELD SLTTVYTDKH PEIIKLKAEK KSILNEISNT
LGEVITSTIY TTNPVYNNLK QELVKLETEL TSLKAQKESL NIQFQKIKAK TEQLPKKELE
YSRLLRKLEV SEKLYTMLLT RYQELKITEA MKVSDIITVD PPVVPESPVG PNMKLNLAIA
IIMGLFVGVF IAFILEFINN TIQRVEEIEE ITDVPIIGYI PYIDKKNDGR DHNN