Gene Haur_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1076 
Symbol 
ID5732865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1231139 
End bp1232665 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content54% 
IMG OID641278214 
ProductPSP1 domain-containing protein 
Protein accessionYP_001543852 
Protein GI159897605 
COG category[S] Function unknown 
COG ID[COG1774] Uncharacterized homolog of PSP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTGG TTGTTGGAGT TAAGTTTAAA GATTCGGGCA AAATTTACCA TTTTGACCCA 
AATCAACATC ATTTAGAGCT AGGCGATGCC GTAGTAGTTG AAACGGTACG CGGGCTAGAG
CTTGGCAAAG TGGCAGCACC GATCGAAGAT CTACCCGATT CTGATTTGAT TGCCGAACTC
AAACCAGTGA TTCGCCAGCC CACCACCGAA GATTATGACC GCATGCGAGT GCTAGCTGAG
CGACGCGATG ATGTATTGCG GATTTGTGCC GAACGAATTA GCGTGCATCG CTTGCCGATG
CGCCTGATTC GCAGCGATTG GAATTTTGAT GGCACCCGTT TGACGATCTA TTTCACCTCG
CAGCAGCGGG TTGATTTTCG CCATTTGGTA CGCGAACTAG CGCGAATTTT CCATGCACGG
ATCGAATTAC GGCAAATTGG GGCACGCGAT GAAGCCAAAC TGCTGGGCGG ATTAGGGCCA
TGTGGCCGAC CATTGTGCTG CTCAACCTTC TTGCCCGATT TTGCCCGCGT TTCGGTCAAA
ATGGCCAAAG ATCAAGACTT GCCACTCAAT CCCTCGAAAA TTTCGGGGGT TTGCGGGCGC
TTGCTCTGCT GCCTTTCGTA TGAACAAGAG CAATATCTTG AGATTAAGGC CGAGCTGCCA
ACCCGTGGCG AGCAGGTCAA AACGCCCGAA GGCATTGGCT ATGTAGTTGC TGTCAATACC
ATTCGCGAAA CCGTTACCGT TGATATTGAA AGTAATTACC ACGATTTCAA AGCTGATCAA
CTTGAAACCC TGACAGGCGC AGCTGGAGCG ATTGCCCGCG AACGGCTCGA ACAAGGCGAG
GCCGCACCAA TTGCCCAAAA ACGTTTCAAC CGCCCAGTTG GCGAAACGAT TAAATCAACG
CTGAACAACT GGGATAACGA AGATTGGGGC TTGGATGAAC TACGCTCGTT TGAGGAAGAT
CCCGAAAGTT CAGGCGATAA GCCCAAACCA CGCCCCAAGC CAGCCCAACA AGCTCGGCCA
AACCCCGAGG CACGCGAACA ACGTACAAAT CCGAACGAGC AACGGCCACG CTTTGACCGA
GCAGAACGCC AGAAACGCTT TAATCGCTCG GAGGCGGCCA ACGCTGAGGC TAGCGTAGCC
CCAACACCAA GCAACGAGCC AAGTGAACGG AAACCACGTC ATGCCTTCAA GCGCAAAGGC
GACACGACCC CAACGCCTGA GCGACCAAAA CTACCGGCGA CAGTGCGTCA GCAATCGCTG
GGCGGCGTGC CAGAGCAACT AACTCCGCGC GGTCGGCAGC CCTCGACGAA TGACCCTGAT
GAACCGAAGC AACCAAATCG TCGCCGACGG CGCGGCAATG GTGGTAATGG CAATCAGCAA
GGGCAACAGC AAGTTCAGCC CAAGCCCAAC CAGCCGCAAG TAACATCAAG CAAGCCATCT
ACGGTCGAGA AAACCGAATC AGAATCAGCG AAGCCAGAGG CGAATAACGA GCGTCGTTCG
CCAAGGCGAC GCAAACGCCG CAGCTAA
 
Protein sequence
MPLVVGVKFK DSGKIYHFDP NQHHLELGDA VVVETVRGLE LGKVAAPIED LPDSDLIAEL 
KPVIRQPTTE DYDRMRVLAE RRDDVLRICA ERISVHRLPM RLIRSDWNFD GTRLTIYFTS
QQRVDFRHLV RELARIFHAR IELRQIGARD EAKLLGGLGP CGRPLCCSTF LPDFARVSVK
MAKDQDLPLN PSKISGVCGR LLCCLSYEQE QYLEIKAELP TRGEQVKTPE GIGYVVAVNT
IRETVTVDIE SNYHDFKADQ LETLTGAAGA IARERLEQGE AAPIAQKRFN RPVGETIKST
LNNWDNEDWG LDELRSFEED PESSGDKPKP RPKPAQQARP NPEAREQRTN PNEQRPRFDR
AERQKRFNRS EAANAEASVA PTPSNEPSER KPRHAFKRKG DTTPTPERPK LPATVRQQSL
GGVPEQLTPR GRQPSTNDPD EPKQPNRRRR RGNGGNGNQQ GQQQVQPKPN QPQVTSSKPS
TVEKTESESA KPEANNERRS PRRRKRRS