Gene Haur_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0996 
Symbol 
ID5732899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1139486 
End bp1140466 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content50% 
IMG OID641278130 
Producthypothetical protein 
Protein accessionYP_001543772 
Protein GI159897525 
COG category[S] Function unknown 
COG ID[COG4301] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03438] probable methyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA CTCAACAAGT AGCCAAAGTA CGCTTGCTTG ATGCCGCCCC CACCATAGCC 
TGCTTCCGCA GCGAAGTGCT TGAGGGCTTG CGTCAACCAA TCAAAACCTT GCCCTGTAAA
TTTTTCTACG ATGCTGAAGG CTCGCAGATT TTTGATGCCA TTTGTGAATT AGCCGAATAT
TACCCAACCC GTACTGAGCT GGCAATTTTG CAGCAAGCCA TGCCTGCAAT TCGCCACTGG
GTTGGGCCTG ATGTCCGGTT AGTTGAATAT GGCAGCGGAG CCAGCCGCAA AACCCGTTTG
CTGCTCGATC AACTTGAAGC ACCAGCGGCC TATCTGCCAA TTGATATTTC ACGCGAACAT
TTGCTCGCCG CCAGCCAAGA TTTAGCCGAG CGCTATCCAG CAATCGAAAT TTTGCCAATA
TGTGCTGATT ACACTCAGCC CTTGAGTTTG CCCCAGGCTC AACGCTCAGT TGGTCGCACA
GTGGTGTTTT ACCCAGGCTC GACAATTGGC AATTTTCACC CTGACGAAGC GTTGAGCTTT
TTAACAATGA TGCGGGAGCT GTGCTTGCCT GATGGTGGCG TGCTGCTTGG CGTTGATCTC
AAAAAAGATC CGAGCCTGTT GCATGCAGCC TACAACGATA CAGCCGGAGT GACTGCAGCA
TTTAATCTCA ATCTGCTGGC GCGAATTAAT CGTGAGCTTG ATGCCAATTT CGCGCTAGCC
AACTTTCGCC ATTATGCCTG CTACAACCCA ATTCAAGGCC GAATCGAAAT GCACTTAGTC
AGTTTGCATG ATCAAACGGT GTGGATTGGC GATCAGCAAA TTGACTTTCG CCGTGGCGAG
CCAATTTGGA CTGAATGCTC CTACAAATAT CATCTGCAAG AATTTGCCGC TTTGGCAGCC
CAAGCCCAAC TTGCGGTAGC CGAGGTTTGG ACAGACCCCC AAAACCTATT CAGCGTGCAC
TATTTACGCC CGATTGCTTA A
 
Protein sequence
MSETQQVAKV RLLDAAPTIA CFRSEVLEGL RQPIKTLPCK FFYDAEGSQI FDAICELAEY 
YPTRTELAIL QQAMPAIRHW VGPDVRLVEY GSGASRKTRL LLDQLEAPAA YLPIDISREH
LLAASQDLAE RYPAIEILPI CADYTQPLSL PQAQRSVGRT VVFYPGSTIG NFHPDEALSF
LTMMRELCLP DGGVLLGVDL KKDPSLLHAA YNDTAGVTAA FNLNLLARIN RELDANFALA
NFRHYACYNP IQGRIEMHLV SLHDQTVWIG DQQIDFRRGE PIWTECSYKY HLQEFAALAA
QAQLAVAEVW TDPQNLFSVH YLRPIA