Gene Hhal_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0234 
Symbol 
ID4711090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp268798 
End bp269928 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID639854694 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001001830 
Protein GI121997043 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.197638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATC AACCGGATCT CGAGGAACAC GAAACCATCG GCCCGCTGGA CCAGGCGGCC 
GAGTGGTTCA GTGAGAACAT CCTGGACAAC ATCACCATCG GCGACTGGAT CGAGGACGGC
GTCGACTGGA TCAGCGACAA CCTGGAGCCC CTGCTCGACG GCATCGAGGG CGCCATCCGC
GCTCTGGTCG ACAGCACCGA GTTCCTCCTC CTCTACCCGC TGTGGATCGC CGCCTTCTTC
CTGGTGGTCG GCGCCTGGCG CACCTGGGGC CGCAAGGCCG GACTGATCAG CCTCGCCGTG
GCGGTGGCGC TGTTCGGCAT GGGCCTGTTT TCCGAGACCG TGCAGGCGCT CTTGTGGTAT
CCGCCGCCGT GGGTGCTGGC GATCCTGCTC ATCGCGGTGT CCTTCTGGCG GGTCGGGTGG
CGCTTCGGCA TCTTCGCCAT CATCGCCCTG GCGCTGATCT TCAGCATGGA GCTGTGGCCG
GAGACCATCC GCACCCTGTC GCTGGTGGTG GCCTCCTCCA TCGCGGCGCT GATCATCGGC
CTGCCCATCG GCATCGCCAT GTCGCGCAAC GACCGGGTGG AGATGGTGGT CCGCCCCATC
CTCGACCTGA TGCAGACCAT GCCGCCGTTC GTCTACCTGA TCCCGGCGGC GATCTTCTTC
GGACTGGGCA CGGTGCCGGG GGCCATCGCC ACGCTGATCT TCGCCATGCC GCCGGCGGTG
CGCCTGACCA ACCTGGGCAT CCGCCAGGTC AGCCAGGAGC ACGTGGAGGC CGGCCAGGCC
TTCGGCTGCA CGCCGCGGCA GCTGCTGTTC AAGATCCAGC TGCCGCTGGC CACGCCATCG
ATCATGGCCG GGATCAACCA GACCATCATG CTCGCCCTGT CCATGGTGGT GATCGCCTCC
ATGATCGGCG CCGGTGGCCT GGGTGGCACG GTGCTCACCG GCATCCAGCG CCTGCAGGTC
GGGCTCGGCT TCGAGGGCGG TCTGGCCGTG GTCTTCCTGG CCATCCTGCT CGACCGCATC
AGCCAGAGCT TCGGCGAGCG TCAGCGCGGC AAGGGCCGCG ACTACGGCGC CCTGCTGCGC
TGGTTCTTCG GTCAAAAGCG CGACCCGGCG CAACAGCCCC AGCAGGGCTG A
 
Protein sequence
MQDQPDLEEH ETIGPLDQAA EWFSENILDN ITIGDWIEDG VDWISDNLEP LLDGIEGAIR 
ALVDSTEFLL LYPLWIAAFF LVVGAWRTWG RKAGLISLAV AVALFGMGLF SETVQALLWY
PPPWVLAILL IAVSFWRVGW RFGIFAIIAL ALIFSMELWP ETIRTLSLVV ASSIAALIIG
LPIGIAMSRN DRVEMVVRPI LDLMQTMPPF VYLIPAAIFF GLGTVPGAIA TLIFAMPPAV
RLTNLGIRQV SQEHVEAGQA FGCTPRQLLF KIQLPLATPS IMAGINQTIM LALSMVVIAS
MIGAGGLGGT VLTGIQRLQV GLGFEGGLAV VFLAILLDRI SQSFGERQRG KGRDYGALLR
WFFGQKRDPA QQPQQG