Gene Hore_12850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_12850 
Symbol 
ID7313606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1379789 
End bp1380850 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content38% 
IMG OID643611725 
ProductStage II sporulation P family protein 
Protein accessionYP_002509030 
Protein GI220932122 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00000279159 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATTA ATAAAGCCAA ATATGTGGTA ATTATAATTA TTCTGGTCCT GGCATGGTTA 
AATATCATGA AATACCGGAC TACTGGTAAT GAAGCAATAC CGGTCTGGTC GAGTTTTGAT
AAAAAATACT ATAGACAGGT TAATATGGGA ACCCTGGATA AAATTAAAAA AACCCTGCAA
CCCGATAATT TAATATATCA GCTAACCGGG ATCCGCATTA AGGCCCCTAT TACCTACCTG
AAACGGGAAA TACCTTTATT AAGTTATTAC ACCCCTGCCA CGTTGAGACA ACCTTCCAGG
AAGGTTTATG AACCTGCAGA AGATAAAAGT AAAAAAAGTA GTGTTATCAG GTTAAAATTT
GATTTGCGTG AAAGTAAACA AAGGGAAGCA GATAATATTA AAAAAGCGGT AGAACGTCCT
CTGGTTGTGA TATACCATAC CCATACTTCA GAAACCTATA TAGATGATCC CCGACCCCAG
GATAACAACG GACATGTTCT GCCGGGTCAA ATCGGGAATA TAGGGAGGGT TGGAGCCGAG
CTTGCCCGGA TCCTTTCAGA ACAACATAAT ATAAGGGTTA TACATACAAC CAGGGTACAT
GATGAAAGTT ATGCCCGGGC CTATTATAAA TCACGACAAA CCCTTAAAAA TATTTTAAAA
AAGTACGAAG GAGTTGACCT GGTACTTGAT ATTCACCGGG ATGGAGTTGA AGATATTAAA
GAAGGGGTAT ATACTACCAC CCTTAATGGA AAAAAAGTTG CCAGAATAAT GATAGTGGTG
ACAAACGGTA AATTTGATTT TGCCAGATTG AATCTTAAAG AGCATCATCA GAACTGGAAG
AAAAACCTTG AGTTTGCTCA AAAAATGTCA GGCAAAATTG AGGAAATGTA TCCTGGGCTC
CTCAAAAGAC TGGAGATTAG AGATACCACC TATAATCAGG ACCTTCATCC CAGGGCTCTA
TTACTGGAAA TAGGTGATTA CAATAATACA ACCACAGAGG CCATAAATTC GGTAAGGTTA
CTGGCTGATG TAATTTCTTC TTTACTGTAT AAAAGGGATT GA
 
Protein sequence
MSINKAKYVV IIIILVLAWL NIMKYRTTGN EAIPVWSSFD KKYYRQVNMG TLDKIKKTLQ 
PDNLIYQLTG IRIKAPITYL KREIPLLSYY TPATLRQPSR KVYEPAEDKS KKSSVIRLKF
DLRESKQREA DNIKKAVERP LVVIYHTHTS ETYIDDPRPQ DNNGHVLPGQ IGNIGRVGAE
LARILSEQHN IRVIHTTRVH DESYARAYYK SRQTLKNILK KYEGVDLVLD IHRDGVEDIK
EGVYTTTLNG KKVARIMIVV TNGKFDFARL NLKEHHQNWK KNLEFAQKMS GKIEEMYPGL
LKRLEIRDTT YNQDLHPRAL LLEIGDYNNT TTEAINSVRL LADVISSLLY KRD