Gene Hhal_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1831 
Symbol 
ID4709273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2002414 
End bp2003739 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID639856302 
Producttetratricopeptide TPR_4 
Protein accessionYP_001003397 
Protein GI121998610 
COG category[S] Function unknown 
COG ID[COG3014] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGTCTG GACTGCGTTG GTCCGGGCTG GCGGTGGCGC TGGCCATCGC GTTGGCCGGG 
TGCACGACGT ACGGTGACCG AATGGGCCCG GTGGAGGCCG CGGTCGACGA GGGGGACCCG
CGCCGGGCAG TGGAACTCCT CGATGAGCGC AGCGGCGGCT CCGGGGACCG GGTGGTGGAC
CTGCTCAACC GGGGCGCGCT GCTGCGCATG GCCGGCGAGT TCGAGGCGAG CAATCGTGCC
CTGGAGGCGG CCTATGACGC CATTGCCGAG GTCGACCCGC TGAGTGTCTC CGAGAGCGTT
GGAAGCCTGA TGGTGGGCGA GACGGTCTTT GCCTATGCCG GCGAGCCCCA CGAACGGGTG
CTGCTCCATC TGCTCATGGC CTTCAATTAT CTGGATCTGG GTGACCCCGA CGCCGCCCGG
GTGGAGGCCC TGCGGGTGGA TCTGCGTCTG CAGCGCCTGG CTGCCGAGCA GGGCCGGTCC
GTCTACCGGC AGGACCCCTT CGCGCGGTAT CTCAGCGGGC TGATCTTCGA GCGTCTCGGC
GAGCCGGATC AGGCCCTGGT GGCGTATCGC CAGGCCTACC AGGCGTACCG GGAGCAGGGC
GGCCGCCTGG GCGTCGCCGT GCCCCAGGCG CTGCACCGGG ATCTGCTGCG TCTGGCTGAT
GAACTGGGTC TCGACGACGA CCGGGCCGAG TGGCAGGAGG CGTTCGGGAA GGACACCTGG
CCGGATCCGG CCGCGCATCG CGAGCAGGCC CATGTGGTGA TCGTCGCCGG GGTCGGATTG
GCGCCGCGTA AGGGCGAACA AGGGTTCGCG GTGCAGGACC ATCAGGGGCG CATCCACAGC
GTGGCCCTGC CGTATTACGA GCCGCGGCAG CGGCCGGTTG GGGGCATCCG GGTCAGCAGC
GAGGCGGCCT CGGTCGTGGC AGAGCCGGTG CACGATATCG ATGCGGTCGC CCGTATCCTG
CTCCAGGAGC AGCAGGCCGC GCTGGCTGCC CGGGCGCTGG GCCGGCTGTT GGTGCAGAAG
GAGATGATCG ATCAGGCCCG GGAAGCCAGT CCGGTAGCCG GCCTGGCTAT GAACATCTTT
ACCCTGGTCG CCGACCGCGC AGATACCCGG ACCTGGGGGA TGCTGCCGGC GCAGTACTAC
ATGGCCCGGA TCAGCCTGCC GCCCGGTGAG CACCGGCTCG AGCTGGTCTA TCAGGGGCGC
TCCGGGCACG CCCTGACCCG GGTGGATCTG GGGCCTCTGG AGCTTGAGGC CGGGGAGTAC
CACTTTGTCT TTGATCGCTG GGTTTCGGCG CATGCTGGCT CCGTAACTCG CAGGGAGGAG
CCGTGA
 
Protein sequence
MRSGLRWSGL AVALAIALAG CTTYGDRMGP VEAAVDEGDP RRAVELLDER SGGSGDRVVD 
LLNRGALLRM AGEFEASNRA LEAAYDAIAE VDPLSVSESV GSLMVGETVF AYAGEPHERV
LLHLLMAFNY LDLGDPDAAR VEALRVDLRL QRLAAEQGRS VYRQDPFARY LSGLIFERLG
EPDQALVAYR QAYQAYREQG GRLGVAVPQA LHRDLLRLAD ELGLDDDRAE WQEAFGKDTW
PDPAAHREQA HVVIVAGVGL APRKGEQGFA VQDHQGRIHS VALPYYEPRQ RPVGGIRVSS
EAASVVAEPV HDIDAVARIL LQEQQAALAA RALGRLLVQK EMIDQAREAS PVAGLAMNIF
TLVADRADTR TWGMLPAQYY MARISLPPGE HRLELVYQGR SGHALTRVDL GPLELEAGEY
HFVFDRWVSA HAGSVTRREE P