Gene Haur_3058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3058 
Symbol 
ID5734930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3863914 
End bp3865581 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content52% 
IMG OID641280202 
Producthypothetical protein 
Protein accessionYP_001545824 
Protein GI159899577 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTGT CGCGTCGCCA GTTATTGTTG GGCGCTGCGC TAGGGGCCAC AGGCGCAGTC 
GTTCATGAGG ATGTTGGGGC AGCGCCGCTG CAACCCTCGA CAACTGAAGC CATGGCAACT
CCGCCATTCG AGGTCATCGC GCTCTCGCGC ATGGCCTACG GAGCACGTTC GGGCGATTTC
GCCCGAGTTC GTAGTATGGG TTTAACTGCC TATGTCGATG AGCAGCTTAA CCCTAATTTC
AACAACGACA CCGATTGTAA CACCCGTATC GCCAACGCCA CCTTGCGCAT TGTCTACGCC
GCTGGCACGG GCTTTCCGGC CATGGATGAA ATGCGTGGCC TTGTTACCCT CAACAAAACC
CAGCCCGAAT TATGGGAATT GCGCGTACAT CCAGCCAACG CTGAGCGCAT TCGCCCAATC
GATGAAGTGG TAGCCGCCAA TTGGATTCGC GCAATCTACA GTAAATGGCA ACTATTCGAG
ATTATGACCG ATTTTTGGCA TAATCACTTC AATGTCTGGG CCTATAGCGA TACGCGCATC
TCTTCGCTTT GGCCCTATTA CGACAAAAGC GTGATTCGCG CCAATTGTTT TGGTAACTTC
CGCAGCTTTC TCGAAGCGGT GGCCACCAGC CCAGCCATGC TGTATTACCT TGATAATGCG
ACCAGCCGCG ATGGCCCCGC CAACGAAAAT TATGCCCGTG AACTATTTGA ATTGCACACC
TTTGGTTCGC AAAATTACCT CAATAATATC TACGACAACT GGCAAGAAGT ACCGCGCGAT
TCGCAAGGCC GCCCAATCGG CTATATCGAC CAAGATGTGT ATGAGGCGGC CCGCGCCTTT
ACAGGCTGGA CGGTGGCCGA TGGCACAGGC GGCATTCCCA ACACGGGTTT ATTCCATTAT
CTCGATACAT GGAACGATAA TGCCCAAAAG ATTGTGCTGG CCAACTTCTT AAATGCCAAT
GCTGGCCCCC AAGCTCATGG CAAAAAGGTT TTGGATCTTG TAGCCCAACA TCCGGCCACA
ATTCGCAATC TTTGTACCAA ACTGTGTCGC CGTTTGGTCA GCGACAATCC ACCAAGCACG
CTGGTAGATA AAGCAGTCGC CACATGGACA GCCAACTATT CAGCCCCTGA TCAGATTAAG
AAAACCATTC GCACAATTTT GCTAGCTCCC GAATTTTTGA GCACATGGGG TGGCAAGATT
CGCCGCCCGA ATGAAGTTGT TGCCGCCTAT TTGCGCTCAA CTGGAGCCGA AGTTAAACCG
AGCGCCGAGC TATTTAGCTG GGTTACCTTG GCGGGGTATC GCATGTTCAA TTGGGCTACA
CCCACCGGCC ACCCCGACGA AAGCGGCTAT TGGAGCAGCA GCAACGCCCT GCTCAATACC
TGGAACTTGT TATTCCACTT GCAGCAAAGC TACTTCCCGC CTGCAACCTT CGATTTGCAA
GGCCAAATGC CGGGCAGCGT CACCACCGTG CGCCAAATCG TCGATTTCTG GATTATGCGC
ATGCTTGGCT ATCAACCTTC AGCCTTGGTC AAAACCAAAT TGCTCAAACT GATGGGCCAA
AATGGCAATC TCGATCAACC ACCAACTGGA ACCGCCAATG ACGTTAAATT ACGCTTGAGC
AGCCTTGTGC ATATGATCGG CATGCTCCCT GAATTTTATA CGCGCTAG
 
Protein sequence
MSLSRRQLLL GAALGATGAV VHEDVGAAPL QPSTTEAMAT PPFEVIALSR MAYGARSGDF 
ARVRSMGLTA YVDEQLNPNF NNDTDCNTRI ANATLRIVYA AGTGFPAMDE MRGLVTLNKT
QPELWELRVH PANAERIRPI DEVVAANWIR AIYSKWQLFE IMTDFWHNHF NVWAYSDTRI
SSLWPYYDKS VIRANCFGNF RSFLEAVATS PAMLYYLDNA TSRDGPANEN YARELFELHT
FGSQNYLNNI YDNWQEVPRD SQGRPIGYID QDVYEAARAF TGWTVADGTG GIPNTGLFHY
LDTWNDNAQK IVLANFLNAN AGPQAHGKKV LDLVAQHPAT IRNLCTKLCR RLVSDNPPST
LVDKAVATWT ANYSAPDQIK KTIRTILLAP EFLSTWGGKI RRPNEVVAAY LRSTGAEVKP
SAELFSWVTL AGYRMFNWAT PTGHPDESGY WSSSNALLNT WNLLFHLQQS YFPPATFDLQ
GQMPGSVTTV RQIVDFWIMR MLGYQPSALV KTKLLKLMGQ NGNLDQPPTG TANDVKLRLS
SLVHMIGMLP EFYTR