Gene Hhal_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1249 
Symbol 
ID4710485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1355204 
End bp1357336 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content70% 
IMG OID639855722 
ProductComEC/Rec2-related protein 
Protein accessionYP_001002826 
Protein GI121998039 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.435974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCAG GATCGCTTGC CTTCGCGGCC GGAATCGTAC TGGTCGTGAG CGTGTTACGG 
GAGCCCCCCG GGGCCTTCTC GGTGGCAGCA GTGGCCGCCT CCGGGGTCGT GTGGGGTCTG
CTCGCCCCGC GCGGCTGGCG CTGGCCGGCT GCCGGGCTCC TAGGCTGCGC CTGGGCGCTG
CTCAGTGCCT GCTGGCTTTT GGCCCATGAA CTTGCCCCTA GCCATGACGG CACCGACGCC
TGCCTGGAAG GGCAGGTCGT TTCCGTCCCT GAGATTGAGA GTCACCGTGC TCGCTTCGAG
TTCCGGCCTG ACGGCGTGCT CGATGGCGAG CTGCTGGAGC CATTGCCGCG ACGCATCCGC
GTGGATTGGT ATGGGGCGAC GGAGGAGCCA GCGCCGGGGG AGCGCTGGCA GCTGTGCCTT
CGCCTGCGCG CCCCCGATGG ATTCCTCAAT CCGGAGGGCT TCGATTACCA GCGGTGGCTG
TTCCAACGCC GGATCGGCGC GACGGCCTAT GTGCGGGAAG CGGAGCAGGC CGAACGCCTG
TCCGGCGGCT GGCGAATCGA TAGCGTGCGG ACTCGGATCG CCGACGCCAT GGCCGAGCGG
CTCGGCGATT CGCGGTACCT GGGCTTGGTC CAAGGGCTGG GCGTGGCCGT GCGCGATAGG
ATCGGCGATG ACCAGTGGGA GGTGCTTTCG GTCACCGGGA CGGCCCACCT GCTAGCGATC
TCGGGGCTGC ATATCGGTCT GGTTGCCGGC CTCGCCGGCA CGCTGGCCGG CGGACTCTGG
CGTGTGGTTC CGTCGCTGGT TCGCCGGGTT CCCGCCCTGA TTGCCGGCAC CCTGGCCGGG
GCGCTGGCTG CAGCCGGTTA TGCCGCCCTG GCCGGCTTCA CCCTACCCAC TCAGCGCGCG
CTGCTCATGG TGCTGGTCTT CGCCGGCGCG CTCCTGCTGC GCCGCCCGTT AAGCCCCTGG
CACAGCCTCG CCGTGGCGGC TGCCGCGGTG CTCGCCCTCG ACCCCTGGGC GCCGCTGGGG
GCGGGGTTCT GGCTCTCCTT CGGGGCGGTG GCGATCATCC TGGCAGCTAC AACGGGTCGC
CCGATGCAGC GCGGTCCCCT ACCATGGCTG CGGATCCAGG TGCTGGTGGG GGTCGGCATG
CTACCAGCGA CTGCGGTCTG GTTCGGGCAC CTGCCGCTTC TCTCGCCGCT GGCCAATCTC
GTAGCCGTGC CCTGGGTCAG CGTATCGGTT GTCCCCCTGA TCCTGACCGG AGTCGCGCTC
CAGCCCGTGG TGCCAGAGTG GTCCGGCGGG TTGTGGCAGG GTGCGGATGC CGCCCTCGGG
GTGCTGATGC GTGTGCTCAG CGCGCTGGCC GAGTTCCTCG GTGCCGTTGA GGTGCCCACC
CCATCGGTCG GAGCGGTGGC GCTGGCCGCT GTCGGTGTGG CTCTGGTGCT GGCTCCTCGG
GCCGTGCCCG GGCGGTGGCT GGCTGGGCCG TTATTGGCGT TGTTGCTCCT GGGGCCCGGC
AGTAGCGGAG GCGGTCCCTC CGGCCGCGTG GTGGTCTTCG ATACCGGCGG AGGTCTGACC
GCTCTGGCCT GCGATGGCCG TCAAGCGGTG GTCTACGGCG GGGGTCCCGG AGGGGGGCTC
GACGCGGCGA GCGTTGCTGT CGAGCCGTTT CTCGAAGCCC GGGGGCTGAC CCTCAAGGCC
TGGCTGGTGC CGCGCGAGCA GGCCCCCTGG GATGGCGCCG TCGCCGAGGC GCGGCAGCGT
TGGGACGATG CCGAGTGGCG GGGGGCTGGA CAGGATACGA CGGCCGAGGC GACGGAGTGG
TCCCTGGGGC GGCTGACGCT GCAGGAGACG CCCCTCGGCG ACGAAGAGTG GGGCCTGGAT
GTCGAGGGAG AGGCGGGTAC CGTATCGCTG CGACCGGACC TGGAGAACGA CACGAAGGGG
TTGGTTGTCG GCGCCGCCGG ATGTGACGAG GACCGCCAGG CGTGCAGTGC GGCGGTTGAG
TTGAAGGACG GTCGTTTGTT GCGTTCGGAC GAGCAGGGCG CGCTGACTTT GATCGAAGAC
GGCGATGCGT GGCGGGTAGA CAGTGAGCTG CAGCGCAGGG GGCGGGTTTA CCATCAGGCC
AGTCCGGCTG CGTTGGCGCC CGGGATGCTT TAA
 
Protein sequence
MRAGSLAFAA GIVLVVSVLR EPPGAFSVAA VAASGVVWGL LAPRGWRWPA AGLLGCAWAL 
LSACWLLAHE LAPSHDGTDA CLEGQVVSVP EIESHRARFE FRPDGVLDGE LLEPLPRRIR
VDWYGATEEP APGERWQLCL RLRAPDGFLN PEGFDYQRWL FQRRIGATAY VREAEQAERL
SGGWRIDSVR TRIADAMAER LGDSRYLGLV QGLGVAVRDR IGDDQWEVLS VTGTAHLLAI
SGLHIGLVAG LAGTLAGGLW RVVPSLVRRV PALIAGTLAG ALAAAGYAAL AGFTLPTQRA
LLMVLVFAGA LLLRRPLSPW HSLAVAAAAV LALDPWAPLG AGFWLSFGAV AIILAATTGR
PMQRGPLPWL RIQVLVGVGM LPATAVWFGH LPLLSPLANL VAVPWVSVSV VPLILTGVAL
QPVVPEWSGG LWQGADAALG VLMRVLSALA EFLGAVEVPT PSVGAVALAA VGVALVLAPR
AVPGRWLAGP LLALLLLGPG SSGGGPSGRV VVFDTGGGLT ALACDGRQAV VYGGGPGGGL
DAASVAVEPF LEARGLTLKA WLVPREQAPW DGAVAEARQR WDDAEWRGAG QDTTAEATEW
SLGRLTLQET PLGDEEWGLD VEGEAGTVSL RPDLENDTKG LVVGAAGCDE DRQACSAAVE
LKDGRLLRSD EQGALTLIED GDAWRVDSEL QRRGRVYHQA SPAALAPGML