Gene Haur_3344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3344 
Symbol 
ID5735214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4215732 
End bp4217276 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content52% 
IMG OID641280491 
ProductPT repeat-containing protein 
Protein accessionYP_001546108 
Protein GI159899861 
COG category[R] General function prediction only 
COG ID[COG5401] Spore germination protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGTG AGGCTCACGA CATCCCAGTG CCCCAAGCCT CGTTTTGTCT AAGTAGCAAC 
CATGTGCGTT GCCCATTATA TGCAGGTGAA GATCTGCCGG TTGCGCAGGT TATCAGCACG
CCTACGCCAG TTGCGGTGGG TGGTTGGCGC GGCTGGCTGG CTGGTTTATC GACCCGCGAT
CGCCGCATTT ATGCCACCTT GGTGGGCCTA CTTGGCTTAA TTATTGTGGC CTATGCGATT
AGCGGCGTTG TTTTATTTAG CAACCCCGAT AACCCTGCCA CGCCTAGCGC TACCTCGCAA
GTGCTTCAGC CAACATCCGA TAGCCCAACA TTAACGGTTT CACCATCGCC AAATGCCTTT
GCCACAGCAG CGGTTCGTCA AACTCAAACA GCCGAAGTTA TTGCTCAAAC CACTACCGTC
ACTCCCTCGG TCTCTGCTAC GTCATCGGCT TCTGCAACCA CTCAGGTGAT TCTTGCATCG
CCAACCTTTG TTATTGTACC GCCAACTGAA GATGTGATTG TCGCCTCTGC TACTCCTAGC
ATTCCGTTTG CCACCGATCT CCCAACCTTT GAGCCAACGT TATCGCCAAT TGCGACAACT
GCCGTGCCAA CGCTTGAACC AACCGCAGAA CCGACTGTTG AACCAACTCT TGAGCCAACG
CTCGAACCAA CAGTTGAGCC GACTGTCGAG CCAACGCCTG AACCGATCCC TGAGCCAACT
GCTCAGCCGA CCGAGGAAAC TGGCGGTCGC GAGGTTAATC AATTAACCTT GTTTTTTGCC
GATAGCACTG GCCAAGTGTT AGTGCCAGTC TCGCGCCAGA TTGCGGCAAC TCGTCAGTCA
CGGACTGCCG CAATCCAACA GTTAATTCAA GGTGCACGCA GCGATTTGCG TAGTTTGTTG
CCCAGCGATA CCCAATTACT TGGGCTACGC TTGAATAATG GCATTGCTAC CGCTAATTTT
AACCGTATCC CGACGTTTGG CAATTCAAGC CTCGAAGATT TGGGTTTGCG TTCGATTGTG
TTGGCCTTGA CTGAGCAACC AGAGGTTAAG CAGGTGCAAA TTCAAGTCCA AGGCCAAAAT
TTAGGTGGCC TGCGCTATCG TCCCAATGTC AACCCCGATA ATCCGCAGGG TTTAAATGGT
CAGTTTAACA CAACTTCGTT CTTGCCGTTA TATTTTCAGC AAAGTAGTGG CCGTTGGGTG
CGGGTGATGC GGCTTGTGCC AAGCACCAAA ACCGAGGCCC GCGCTACCGT CAATGAGCTG
ATTCGCGGAG CTGGCCGTTA TAGTCATGTT GTTAGTAGTG CCATCCCGAG CGCCAGCCAA
GTACGGCGTT TGGTGATTGT TGATGGGGTT GCTCAACTTG ATCTTAGCGC TGAATTCAGC
CAAACCAGCA ATCCGCAGGC GGCGGTTGAT GCCTTGGTCT TGGCGTTAAC TTCGTTCAGT
AGTGTGCAAC AGGTACAGAT TACCGTCGAA GGCCAATCGC TCAGCAGCAT TTGGGGCGCA
ACATTCAGCA ATCCTTTCGT TCGCCCACAA CTTAACCCTG AATAG
 
Protein sequence
MTGEAHDIPV PQASFCLSSN HVRCPLYAGE DLPVAQVIST PTPVAVGGWR GWLAGLSTRD 
RRIYATLVGL LGLIIVAYAI SGVVLFSNPD NPATPSATSQ VLQPTSDSPT LTVSPSPNAF
ATAAVRQTQT AEVIAQTTTV TPSVSATSSA SATTQVILAS PTFVIVPPTE DVIVASATPS
IPFATDLPTF EPTLSPIATT AVPTLEPTAE PTVEPTLEPT LEPTVEPTVE PTPEPIPEPT
AQPTEETGGR EVNQLTLFFA DSTGQVLVPV SRQIAATRQS RTAAIQQLIQ GARSDLRSLL
PSDTQLLGLR LNNGIATANF NRIPTFGNSS LEDLGLRSIV LALTEQPEVK QVQIQVQGQN
LGGLRYRPNV NPDNPQGLNG QFNTTSFLPL YFQQSSGRWV RVMRLVPSTK TEARATVNEL
IRGAGRYSHV VSSAIPSASQ VRRLVIVDGV AQLDLSAEFS QTSNPQAAVD ALVLALTSFS
SVQQVQITVE GQSLSSIWGA TFSNPFVRPQ LNPE