Gene Hhal_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2152 
Symbol 
ID4709707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2360460 
End bp2362130 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content67% 
IMG OID639856627 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001003718 
Protein GI121998931 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0875259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGT ACGTCTTTAC CATGAACCGG GTATCCAAGG TGGTACCCCC CAAGCGACAG 
ATCCTCAAAG ACATCTCGCT GTCCTTTTTC CCGGGCGCCA AGATCGGCGT CCTCGGCCTC
AACGGGGCGG GCAAGTCCAG CCTGCTGCGC ATCATGGCCG GCGAGGACCA GGAGATCGAC
GGCGAGGCGC GGCCGCAGCC GGGCATCAAG ATCGGTTACC TGCCCCAGGA GCCGCGGCTC
GACGACGACA AGGACGTGCG CGGCAACGTC GAGGAAGGCG TGGCCGAGAC CAAGGCGCTG
GTGGACCGCT TCAATGAGGT CTCGGCGCAG TTCGCCGACC CGGAGGCCGA CTTCGACGCC
CTGATCGCCG AGCAATCCAA GCTTCAGGAT CAGATCGAGG CGGCCGGCGC CTGGGATCTG
GACCGCAAGC TGGAGCAGGC CGCCGACGCC CTGCGCCTGC CCCCGTGGGA CGCCGCCGTA
GCCAACCTCT CCGGTGGCGA GCGGCGCCGC GTGGCGCTGT GCAAGCTCCT GCTCTCGGCG
CCGGACATGC TCCTGCTCGA TGAGCCCACC AACCACCTCG ACGCCGAGAG CGTGGCGTGG
CTGGAGCGTT TCCTCCACGA GTTCCCAGGC ACCGTGGTGG CCGTGACCCA CGACCGCTAC
TTCCTGGACA ACGTGGCCGG GTGGATCCTC GAGCTCGACC GCGGCGAGGG CATCCCCTTC
CAGGGCAACT ACTCTGCGTG GCTGGAGTAC AAGGAGAAGC GACTGCAGCA GGAGGCCCGC
GAGGAGCAGG CCCAGCGCAA GGCGATCCAG CAGGAGCGTG AGTGGGTCCA GCAGAACCCC
AAGGGCCGGC AGGCCAAGAA CAAGGCCCGC ATCAAGCAGT TCGAGGAGCT GCAGTCCCAG
GAGTTCCAGA AGCGCAACGA GACGCAGCAG CTCTACATCC CGCCGGGGCC GCGCCTGGGC
GACAAGGTAG TGATTGCCGA CGGAGTGCAA AAAGCCTTTC GCGGCGAGCT GCTCTATGAG
GACCTCAGCT TCATCATCCC GCCCGGCGCC ATCGTCGGCC TGGTCGGTCC CAACGGCGCC
GGTAAGACCA CGCTGTTCCG CATGATCACC GGCGAGGAGC AGCCCGACGC CGGCACCATC
GACGTCGGCG AGACTGTGGA ACTGGCCTAC GTGGACCAGA GCCGCGATGC CCTCAACCCG
GAGAACACCG TCTGGCAGGA GATCTCCGGC GGCCACGACT ACATCCGGGT GGGCAACTAC
GAGACCCCGT CCCGGGCCTA CGTGGGCAAA TTCAACTTCC GGGGCTCCGA GCAGCAGAAG
CGCGTCGGCG ACCTCTCCGG CGGCGAGCGC AACCGCGTTC ACCTGGCCAA GCTCCTGCAG
AGCGGCGGCA ACACCCTGCT GCTCGACGAG CCGACCAACG ACCTGGACGT GGAAACCCTG
CGTGCCCTCG AAGACGCCCT GCTCACCTTC CCCGGGTGTG CCCTGGTCAT CTCCCACGAT
CGCTGGTTCC TGGACCGGAT CGCCACCCAC ATCCTGGCCT TCGAGGGCGA GAGCCGAACT
GCCTTTATCG AGGGTAACTA CCAGGATTAC GAGGCCGACC GGAAGAAGCG CCTCGGCGAC
GAGGCGGCCC AGCCCCACCG CATCAAGTAC AAGCGGCTCG GCACGGGCTG A
 
Protein sequence
MAQYVFTMNR VSKVVPPKRQ ILKDISLSFF PGAKIGVLGL NGAGKSSLLR IMAGEDQEID 
GEARPQPGIK IGYLPQEPRL DDDKDVRGNV EEGVAETKAL VDRFNEVSAQ FADPEADFDA
LIAEQSKLQD QIEAAGAWDL DRKLEQAADA LRLPPWDAAV ANLSGGERRR VALCKLLLSA
PDMLLLDEPT NHLDAESVAW LERFLHEFPG TVVAVTHDRY FLDNVAGWIL ELDRGEGIPF
QGNYSAWLEY KEKRLQQEAR EEQAQRKAIQ QEREWVQQNP KGRQAKNKAR IKQFEELQSQ
EFQKRNETQQ LYIPPGPRLG DKVVIADGVQ KAFRGELLYE DLSFIIPPGA IVGLVGPNGA
GKTTLFRMIT GEEQPDAGTI DVGETVELAY VDQSRDALNP ENTVWQEISG GHDYIRVGNY
ETPSRAYVGK FNFRGSEQQK RVGDLSGGER NRVHLAKLLQ SGGNTLLLDE PTNDLDVETL
RALEDALLTF PGCALVISHD RWFLDRIATH ILAFEGESRT AFIEGNYQDY EADRKKRLGD
EAAQPHRIKY KRLGTG