Gene Hhal_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0335 
Symbol 
ID4711295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp385086 
End bp386690 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content67% 
IMG OID639854798 
ProductBCCT transporter 
Protein accessionYP_001001931 
Protein GI121997144 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.568613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCAAC CCGATTCGGC GGGTAACCGC CAGGCTGTGC CCGAACGGGA GATTCGCGGC 
AACGAACGCA CAGTGATGGT GCTCTCGGGT GGCTTTGTCC TGGCCTTCAT CGCCATCGCG
GCGGTGGACC TGGACTGGCT GAGCGCAGCG GTGGATACCA CCTTCGGCTA CGCCACGCAG
TACTTCGGCG CGTACTGGCA GCTACTGATG CTCGCCACTT TCCTGATCGC CCTGGTCATG
GCCTTCAGCC CGCTGGGCCG GCTGCGGGTC GGCAACGATA AGCCGGATTA CAGCAACTTC
TCCTGGCTGT CCATGATCAT GGCCACGCTG CTCGCCGGTG GCGGCGTCTT CTGGGCGGCC
GCCGAGCCCA TTGCCCACTT CATGGATCCG GCGCCGCTGT TCGCCGCCGA GCTGGGCATC
GAGGGCGGCG AGGAGGGCGC CGCGGTGCCG GCGCTGGCGC AGAGCTTCAT GCACTGGGGC
TTCCTCGCCT GGTCGATCCT CGGGGCGCTG ACCGGCGGCA TGCTCATGCA GCTGCACTAC
CACCGGGGCT GGCCGCTGAA GCCGCGCACG CTGCTCTATC CGCTGTTCGG CGAGCGCATC
ATGCACGGCA CCCCCGGGGC CATCGTCGAC ACCTTCTGTG TGATCGCTGT GGTGGCCGGC
ACCGTGGGGC CGCTGGGCTT TCTCGGGCTG CAGGTGGCCT ACGGCCTGAG CGATCTCTTC
GGGCTGCCGG ATAGCGTCTG GCTGCCGGCG GCGGTGCTGC TGGGGGTGAT CCCGCTCTAT
CTGTTCTCGG CGATCACCGG GCTGAACCGC GGCATCCGCG TCCTCAGCCG CTTCAACGTG
ATGCTGGCGC TCTTCCTGGC CGCCTTCATC CTGCTCTTTG GCCCGACGGC GTTCATCATC
GACGGCTTCG TCCAGGGCAT GGGCACCTAC GTGGATAACC TGCTGCCGAT GGCGACGTTC
CGCGAGGATC CGGCCTGGCT GGACTGGTGG ACGGTCTTCT TCTGGGGCTG GTTCATGGGC
TACGGCCCGC TGATGGCGAT CTTCGTGGCG CGCATCTCCC GCGGGCGCAC CATCCGCCAG
ATCGTGCTGG CCATGTCGGT GCTCGCCCCG GTGGCCACCT GCTTCTGGTT CGCCATCGTC
GGCGGTTCGG GCATCGCCTT CGAGCTGGCC ACGCCGGGTG CCATCTCCGA GCCCTACAAC
GAGGCCGGGC TGCCGGCGGC GATGATGGCC ATCGCCCAGC AGATGCCCTT CGGTGCGATC
ATCGCCGTGC TCTTCCTGGT GCTGACCACC ATCTTCGTGG CCACCACCTC GGACTCGATG
ACCTACACCA TCTCGCTGAC CATGAGCGAG ACCGACGAGC CGGCCACCTG GCTGCGGGTC
TTCTGGGGCC TGGTCCTGGG CGTGATGGCG ATGATCCTCA TGCTCATGGG CGAGGGCGGT
GTCACGGCCC TGCAGTCGTT TATCGTGGTC ACCGCGGTGC CGGTGTCGCT GATCCTGCTG
CCGTCGCTGT GGTACGCCCC GCGCATGGCC ATGCACCTGG CCGACGGCGA GGAGCCGGCT
ATCGACGGTG AAACGGGGCA GTCGGTGAAG CCGACCGGGC GCTAA
 
Protein sequence
MSQPDSAGNR QAVPEREIRG NERTVMVLSG GFVLAFIAIA AVDLDWLSAA VDTTFGYATQ 
YFGAYWQLLM LATFLIALVM AFSPLGRLRV GNDKPDYSNF SWLSMIMATL LAGGGVFWAA
AEPIAHFMDP APLFAAELGI EGGEEGAAVP ALAQSFMHWG FLAWSILGAL TGGMLMQLHY
HRGWPLKPRT LLYPLFGERI MHGTPGAIVD TFCVIAVVAG TVGPLGFLGL QVAYGLSDLF
GLPDSVWLPA AVLLGVIPLY LFSAITGLNR GIRVLSRFNV MLALFLAAFI LLFGPTAFII
DGFVQGMGTY VDNLLPMATF REDPAWLDWW TVFFWGWFMG YGPLMAIFVA RISRGRTIRQ
IVLAMSVLAP VATCFWFAIV GGSGIAFELA TPGAISEPYN EAGLPAAMMA IAQQMPFGAI
IAVLFLVLTT IFVATTSDSM TYTISLTMSE TDEPATWLRV FWGLVLGVMA MILMLMGEGG
VTALQSFIVV TAVPVSLILL PSLWYAPRMA MHLADGEEPA IDGETGQSVK PTGR