Gene Haur_1275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1275 
Symbol 
ID5733168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1484680 
End bp1485990 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content52% 
IMG OID641278415 
Producthypothetical protein 
Protein accessionYP_001544051 
Protein GI159897804 
COG category[S] Function unknown 
COG ID[COG4842] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0739014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAC CAATCGTTCA AGCTGATTTT GAAGTAATGG ATCAAGTTGC CCAGCGCCTT 
AGCAAAAATG CTGAGAGTGT TACGGCAATG CAAAATACGC TCAAACAAAC TATTGAAGAC
TTACGCTCAA CATGGTTGGG TGATGCTGCG GTTGCATTTC AAAAAGAAAT GCAGGCTGAT
ATTTTGCCTG CAGTGCAACG GCTGATCAAC GCTTTCCAAA CCGCCCAAAG TACAACCTTA
GAAATTAAGA AAGTTTTACA AGAAGCCGAG CAAGAGGCCG CCAATCTATT TAAAGGCGAT
CCAACTGGTG GCTCAGCCAG CACCCAAAGC GCTAGTTCAA GCGCTGGTGG CGGTGGAGCC
TCAAGTGCTG GTGGCGATAC TGCGGCAGCC AGTGCTAGCC CAAGTAATGT TGGCGTAATG
GCTGGTGGCA CCAGCAGCGC TTCTGCCAGT GGCAGTGGCG GCGGCGGTGG TGGCGGTGGC
GGTGCAGCCT CGGCTCAAGC AAGTGGCGGC GGCGGTGGTG GTGGTGGCGG AGCAGCCTCA
GCTCAACCAA CTGGCCAACA ACCCAAGGCT ACCAGTGGTG GCGGCGGTGG CGGTGGTGGT
GGTGGCGGAA CAGCCTCAGC CCAACCAACT GGTGGTAATG CAGCCGCTGC AGGTAATGCA
AGCCTTGGTA AACTCTCTGA AAAATACGAA ACTGGTGGCC GTGGCCCAGG CACGGTTTCA
TCAGGCAAAG GCGACCTTGG CGGCGCTTCA TATGGCTCAT ACCAAATGAC CAGCCAAACT
GCCATCAAAA AAGATGGCAA AATTGTCTTT GTTAATGGCG GACGGGTCGC TGAATTTTTA
CGCAACCCTG CTGGTGCACA ATATGCTGAA GAATTTAAGG GCTTGAAACC AGGGAGCGCT
GAATTTACCG CCAAGTGGAA GCAAATTGCT GCTCGCGATC CACAAGGCTT TGCTGCTGCC
CAACATCAGT ATATTGAAAA CACCCACTAT CAGCCTCAAG TCAACAAGCT CAAGGCAGCT
GGCTTTGATG TAAACAACTA TTCGCCAGCA ATGCGCGATG TTGTTTGGTC AACCTCAGTT
CAACATGGCC CAGGCGCAAG CGTGATCACC AATGCGCTCC GTGGCAAAGA TCTTAGCCAA
ATGAGCGAAT CGCAAATTAT CAATGCGATT TACACCGAAC GTAGCAAAAC CCTCGATAAT
GGGCGCTTGG CCTATTTCAA AAATACCAGC GATGCTGGGG TTATTCAAGG CTTGAAAAAC
CGCTTCGTCA ACGAACGCAA AGATGCCTTG AACATGTCGG CAAATCACTA G
 
Protein sequence
MAAPIVQADF EVMDQVAQRL SKNAESVTAM QNTLKQTIED LRSTWLGDAA VAFQKEMQAD 
ILPAVQRLIN AFQTAQSTTL EIKKVLQEAE QEAANLFKGD PTGGSASTQS ASSSAGGGGA
SSAGGDTAAA SASPSNVGVM AGGTSSASAS GSGGGGGGGG GAASAQASGG GGGGGGGAAS
AQPTGQQPKA TSGGGGGGGG GGGTASAQPT GGNAAAAGNA SLGKLSEKYE TGGRGPGTVS
SGKGDLGGAS YGSYQMTSQT AIKKDGKIVF VNGGRVAEFL RNPAGAQYAE EFKGLKPGSA
EFTAKWKQIA ARDPQGFAAA QHQYIENTHY QPQVNKLKAA GFDVNNYSPA MRDVVWSTSV
QHGPGASVIT NALRGKDLSQ MSESQIINAI YTERSKTLDN GRLAYFKNTS DAGVIQGLKN
RFVNERKDAL NMSANH