Gene Haur_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0868 
Symbol 
ID5732769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp986525 
End bp988441 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content52% 
IMG OID641278000 
ProductASPIC/UnbV domain-containing protein 
Protein accessionYP_001543644 
Protein GI159897397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA CCTTGTTGAA ATACCAAACC CAGTTGCTGG CGGTCGCGGT GCTAGTTGGC 
ACGTTTATCG TGGCCCAACC ACCAACGCTT TCACAGGCTG AGCAAGCCGA TTTGAGCCAA
AGTTTTGGCT TTGCGCAGCA GCCCCTTGCC GCATTGGCGG GCCATACCCA GCGCTTCATC
CGCCCAGTTA ATCCAAGTTT GGCCCATATC GATGCTTGGA TCTCGTCGGT TGGGGCGGCG
ATTGCCCTCA ACGATCTCGA TAACGATGGC TTATCCAACG ATATTTGTTA CGTCGATACC
CGAATTGATC AAGTGGTAGT ACAACCAGCC CAAACCAGCA ACGCGCGTTA TCCAGCTTTT
GCGCTTGATC CAAACAGCCT TAAATACGAT GCCAGCACAA TGGCTCCGAT GGGTTGTTTG
CCCAGCGATA TCAACGAAGA TGGCATGCTC GATTTGATTG TGTATTACTG GGGGCGCACG
CCAATTATCT TTGTCCAACA ATATCAGGGA GCAAATGTCG ATCTCAGCAG CCAAAGCTAT
GTCGCCCAAG AGCTTGTAAC CACAGGCGAG CGCTGGTTTA CCAACACTGG CTTGGTCAGC
GATTTCGACG GCGATGGCCA TCAGGATTTG CTGTTTGCCA ATTATTTTCC TGATGGTGCG
GCGATTCTCG ATGCCAAATC CAGCCGCAAG CAAACCATGC AAGCCTCGAT GTCACGTGCA
TTCAACGGCG GCGATAAGCA CTTTTTCCTT TGGCAGCAAA CCAGCACCGA ACAAGCGCCA
TTTGTGGCTG TGCCCGATGT GCTGAGCGGC GAGTTGAACC ACGGCTGGAC GTTAGCACTA
GGAACCTACG ATTTCAACAA TGATCTGTTG CCAGAACTCT ACATTGGCAA CGACTTCGGC
CCCGATCGTT TTTTAATCAA CCGCTCAACG CCTGGCACGA TCAAATTAGA ACTGGCTGAG
GGCAGCGGGG GCTTCACGAT TCCCACATCC AAAGTGATTG GACACGATTC GTTCAAGGGC
ATGGGCGTTG ATTTCAGCGA TATTAATAGC GATCAGCACC CCGATATTTT TGTGAGTAAC
ATCACCACAC CGTTTGGTTT GCACGAAAGC AATTTTGCCT ATGTCAGCGA TCCCAGCGCC
AAACTCGATC AAAGTGAATT GCCCAATTAC ACCGACCAAA GCGAGCAACT AGGCTTTGGG
CGCAGTGGCT GGGCTTGGGA TATTAAGCTA GCCGATTTCA ATAACGACCA GCGCGACGAA
ATTTTGCAAG CAACCGGCTT TGTCAAAGGC ACAATCAATC GTTGGCCCGA ACTGCAAGAA
TTGGCAACAG GCAACGATCA ATTGCTCGCC GACCCCGCCA GTTGGCCGCG CTTCAGTGCT
GGCGATGATA TCGCGGGCCA TCAAATTAAC CCATTTTTTA GCCAAGCCGC CGATGGCCGC
TACTACGACC TTGCCAAAAC CCTAGGCTTT GCGCCAACCG TCAGCCGTGG CATTGCAGTT
GGCGATGTTG ATGGCGATGG TAAGCTTGAT TTTGCCTCAG CCAACCAATG GGAAGATTCG
ATCTTCTACC ATAACACCAG CCAAAGCAGC AATCAAGCGC TTGGTTTACG CCTGCGGATC
GCCAGCGATG GCAAGGCCAG CAATATTACA GGCTTTCAAC CAACCAGCGC GGCGATTCCG
GCAATTGGAG CGCATGTCAC GGTCAAATTG CCCGATGGTC GCACGGTTAG CAGCCAAGTT
GATGGCGGCA ATGGACATTC AGGCAAACGC AGCTACGATT TGCACTTTGG CTTAGGGGCA
CTTGATCCAC AAACCCAACT TGAAGTAACG GTGCGCTGGC GTGGCCGCGA TGGCAGTGTT
CAAACCAGCG TCGTCCAACT CACACCAGGC AACCATACCT TGATACTTGG CCAATAA
 
Protein sequence
MKQTLLKYQT QLLAVAVLVG TFIVAQPPTL SQAEQADLSQ SFGFAQQPLA ALAGHTQRFI 
RPVNPSLAHI DAWISSVGAA IALNDLDNDG LSNDICYVDT RIDQVVVQPA QTSNARYPAF
ALDPNSLKYD ASTMAPMGCL PSDINEDGML DLIVYYWGRT PIIFVQQYQG ANVDLSSQSY
VAQELVTTGE RWFTNTGLVS DFDGDGHQDL LFANYFPDGA AILDAKSSRK QTMQASMSRA
FNGGDKHFFL WQQTSTEQAP FVAVPDVLSG ELNHGWTLAL GTYDFNNDLL PELYIGNDFG
PDRFLINRST PGTIKLELAE GSGGFTIPTS KVIGHDSFKG MGVDFSDINS DQHPDIFVSN
ITTPFGLHES NFAYVSDPSA KLDQSELPNY TDQSEQLGFG RSGWAWDIKL ADFNNDQRDE
ILQATGFVKG TINRWPELQE LATGNDQLLA DPASWPRFSA GDDIAGHQIN PFFSQAADGR
YYDLAKTLGF APTVSRGIAV GDVDGDGKLD FASANQWEDS IFYHNTSQSS NQALGLRLRI
ASDGKASNIT GFQPTSAAIP AIGAHVTVKL PDGRTVSSQV DGGNGHSGKR SYDLHFGLGA
LDPQTQLEVT VRWRGRDGSV QTSVVQLTPG NHTLILGQ