Gene Hhal_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1076 
Symbol 
ID4709868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1170499 
End bp1171833 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID639855547 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001002654 
Protein GI121997867 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0571583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCT GGCGCGACGG ACTCACCCAG GTACGACGCT ACCCTTCGGC CGTGGTGGGG 
CTGCTGATCA TCGCCATGCT GCTCGCCACC GCGGTCTACG CCGTCACCGC CATCCCCTAC
TCCGAGGCGC AGAAGCTCTG GCGCGGCGGC GATGCCTGGC AGGATCTGCC GGTCAACGCC
AGCCCGGCAT GGACCGATCG GCTCTTCGGT GGCGAGGCGC CGCGCACCAT CACTGCCTCC
AGCGACGACG CGCCCACCGA GACCCAGGAC CTGGGCGCGG CCCGGCTCGA GACCACCACC
CTCAAGGTGG ACTTTCCGTA CGACGATTTC CCTTCGGAGA TCAACCTGTT CCTCGAGGCG
GAGTTCGAGG AGGGCCAGCC CTTTGCACGG GTCTACTGGC GCACCCCGGA CGGCCAGGAG
ATCCCCATCG ACGCCCGGCG GATCGGCGAG CGGGAACGCT TCTCGATCTC CCAGGACCGG
CAGCTGGAAA GCCGGCTGGG CTTCCCGCCC CAGGTCGGCC TGTTCACCGA CGCCGAGCCC
GGCACGCCGA ACCCGGTGGT GCGCGGCGGC ACCTACGAGT TGATCATCGA GACGGTGCAC
TTTGAGGAGT TGGCCCGGAT GCACGCTGAC CTGGTCGTCT ACGGGCAGGT CCACGGCGCC
TTCGGCACCG ACCACCAGCG CCGCGATCTG GCTGTCGCCC TGCTCTGGGG CACACCGGTG
GCGCTGGCCT TTGGGCTGCT GGCCGCCGTC GGCACCACGA TCACCACCCT GATCATCGCC
GCCACCGGGG TCTGGTACGG CGGCTGGGTA GACGCCACCA TCCAGCGGCT CACCGAGGTG
AACATCATCC TGCCGCTGCT GCCCATCCTG GTGATGATCG GCACCCTCTA CTCGACCAGC
ATCTGGCTGA TGCTCGGCGT GGTGGTGGTG CTGGGCATAT TCAGCGCCGG GATCAAGATG
TACCGCTCCA TGCTCCTGCC CATCCGCCAG GCACCCTACA TCGAGGCAGC CCGCGCTTAC
GGGGCCAGCG GCGGGCGGAT CATCCTGCGC TACATGGTGC CGCGGATCCT GCCGGTACTG
ATCCCGACCT TCGTGACCCT GATCCCCACG TACGTCTTCC TGGAGGCGTC GCTGGCGGTC
CTCGGCCTGG GTGACCCGGT GCTACCGACC TGGGGCAAGG TCCTCCACGA CGCCCAGGCC
CAGAGCGCGC TCTACCACGG GTTCTACTAC TGGGTGCTGT CACCGGCGGC GCTCTTGATG
CTGACCGGGC TCGGCTTTGC CATGCTCGGC TTTGCCCTGG ACCGGATCTT CAACCCGCGG
CTGAGGAGCA TCTGA
 
Protein sequence
MIRWRDGLTQ VRRYPSAVVG LLIIAMLLAT AVYAVTAIPY SEAQKLWRGG DAWQDLPVNA 
SPAWTDRLFG GEAPRTITAS SDDAPTETQD LGAARLETTT LKVDFPYDDF PSEINLFLEA
EFEEGQPFAR VYWRTPDGQE IPIDARRIGE RERFSISQDR QLESRLGFPP QVGLFTDAEP
GTPNPVVRGG TYELIIETVH FEELARMHAD LVVYGQVHGA FGTDHQRRDL AVALLWGTPV
ALAFGLLAAV GTTITTLIIA ATGVWYGGWV DATIQRLTEV NIILPLLPIL VMIGTLYSTS
IWLMLGVVVV LGIFSAGIKM YRSMLLPIRQ APYIEAARAY GASGGRIILR YMVPRILPVL
IPTFVTLIPT YVFLEASLAV LGLGDPVLPT WGKVLHDAQA QSALYHGFYY WVLSPAALLM
LTGLGFAMLG FALDRIFNPR LRSI