Gene Haur_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0966 
Symbol 
ID5732852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1107370 
End bp1109499 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content52% 
IMG OID641278098 
Producthypothetical protein 
Protein accessionYP_001543742 
Protein GI159897495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000682742 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCA AACGTTCGTT GATCGCTATC GGGTTGGCGC TCGCTATCGC CTTGTTGAGT 
GTGTTTGCGG CTAGTAGCCT CACACCATCC AGCGCGGCAT CGTCGTCCGA TGGGCTATGG
CAAGACGTTG CTGAACAACG GATTCAGCAA AAAGGCCAAC GTGATATCGT GCCGTTGGTC
TACCGCACGG TCAGCCTCGA CCTCGCAGGT TTGAGCCAGC GTTTAGATCA AGCACCCTTG
GAAAGTGCTG TGCAGGTACA GCAATCAGCA TTTTTGTTGA GCTTACCGCT GCCAAGCGGC
CAATTTCGCC AATTTCGGGT GGTCGAATCG CCAATTATGG AGCCTGCGCT GGCGGCAAAG
TTCCCTGAAT TGCGCACCTA CTTGGCACAA GGCTACGACG ACCCCGAAAT GGTTGCACGG
CTTGATCTTA CGCCAAGCGG CTTTCATGGT TTGATCTTGG CTCCGGAAGG GCGCTATTTT
ATCGACCCCT ACAGCCGCAA CGATACTGGC AATTATATTG TCTATGATAG CCGTAATTTT
GTGGCCGACC CCAGCAAACT CGCCAGCAAA GGCAAGACCG ATTATGTTGG CGAAACTCCC
ATCACCAACC CATTCCCTGA GCGCTATAGC ATTGGCGAAA CCTTGCGCAC CTATCGCCTC
GCCATGGCTG CTACAGGCGA ATACACCAGT TTTCATGGTG GCACGGTCAA TGGAGCAATG
GCAGCAATCG TAACTAGCGT CAATCGCGTT AATACCGTTT ACGAACGCGA TATCTCGGTA
CGTATGGTGT TGGTTGCCAA CAATAACTTA ATTGTGTATA CCAATGGCGG CACTGACCCC
TACACCAACG ACGATGGCTT TGAGATGCTG GGCGAGAATC AAACTAACCT CACCAGCGTG
ATTGGTAACG CCAATTATGA TATTGGTCAC GTATTCAGCA CTGGTGGTGG CGGGGTTGCC
GCACTTGGCT CAGTCTGTGT CTCAGGCTCA AAAGCTGAAG GTGTGACGGG TTCACCAGCT
CCGGTTGGCG ATCCTTTTGA CATTGATTAT GTCGCCCACG AAGTTGGTCA CCAATTCGCA
GGTAACCACA CCTTCAACGG TACAACTAAC GCCTGTGGCG GTGGCAATCG TGAAGGCCCA
GCCGCCTACG AACCAGGCAG CGGCTCAACC ATTATGGCCT ATGCTGGGAT TTGTGGCTCG
GAAAATCTGC AACCCAACAG TGATCCATAT TTCCATGTGA AAAGCTTGGA AGAAATGAGC
GCCTTTATTA CAACTGGTGC TGGCGCAAGC TGTGGTACCA CGGCGGCCAC TGGCAACACG
CCACCAACCG CTAACGCTGG CGCAGATTTC ACGATTCCTG CCAATACGCC GTTTGAATTA
ACTGGCAGCG GCAACGATGT GAACGGCGAT AGCCTGACTT ACAATTGGGA GCAATACGAT
TTAGGTTCAG CATCGCCACC GAATACTGAT AACGGCAATC GCCCAATTTT CCGTAGTTTC
GATTCAACCA CTTCGACCAG CCGTAGCTTC CCACGCTTGA CCAATATTTT GAACAACTCG
ACGACGATTG GTGAATCGAT GGCAACGACC AATCGCACCA TGAATTTCCG CCTAACCGTC
CGTGATAATC GGGCTGGTGG CGGTGGTTAT GGCTTGGATA CAGCACGAGT TACAACCGTC
AATACTGCTG GCCCCTTCCA AGTAACCGCG CCAAACACCG CCGTAACCTG GGCTGGCTTC
AGCAGCCAAA GCGTTACTTG GAATGTTGCC AACACGACTG CTGCACCAGT CAATTGTAGC
AATGTCAATA TTTTGTTCTC AAGCAATGGT GGTACGAGCT TTAGCCCAGT GCTGAGCAAC
ACGCCTAATG ATGGCAGCGA GAGCATCACC GTACCAAACG TTGCTACCAC AACTGGTCGG
ATCAAAGTGC AATGTGCTGG CAATGTTTTC TTTGATATTG GCAACGCCAA CTTCACGGTA
ACCGCCAGCA ATGCCACGGT CACGCCAACC AGCGATGCCA CGGCAACCCC AACCATTACG
CCAACGGCAA CCGCAACGGT TACCCCAAGC GTAACAGCTA CCCCAAGTAC ATCAATGGTC
TACTTGCCAG TAGCCATGAA ACAACCCTAA
 
Protein sequence
MQTKRSLIAI GLALAIALLS VFAASSLTPS SAASSSDGLW QDVAEQRIQQ KGQRDIVPLV 
YRTVSLDLAG LSQRLDQAPL ESAVQVQQSA FLLSLPLPSG QFRQFRVVES PIMEPALAAK
FPELRTYLAQ GYDDPEMVAR LDLTPSGFHG LILAPEGRYF IDPYSRNDTG NYIVYDSRNF
VADPSKLASK GKTDYVGETP ITNPFPERYS IGETLRTYRL AMAATGEYTS FHGGTVNGAM
AAIVTSVNRV NTVYERDISV RMVLVANNNL IVYTNGGTDP YTNDDGFEML GENQTNLTSV
IGNANYDIGH VFSTGGGGVA ALGSVCVSGS KAEGVTGSPA PVGDPFDIDY VAHEVGHQFA
GNHTFNGTTN ACGGGNREGP AAYEPGSGST IMAYAGICGS ENLQPNSDPY FHVKSLEEMS
AFITTGAGAS CGTTAATGNT PPTANAGADF TIPANTPFEL TGSGNDVNGD SLTYNWEQYD
LGSASPPNTD NGNRPIFRSF DSTTSTSRSF PRLTNILNNS TTIGESMATT NRTMNFRLTV
RDNRAGGGGY GLDTARVTTV NTAGPFQVTA PNTAVTWAGF SSQSVTWNVA NTTAAPVNCS
NVNILFSSNG GTSFSPVLSN TPNDGSESIT VPNVATTTGR IKVQCAGNVF FDIGNANFTV
TASNATVTPT SDATATPTIT PTATATVTPS VTATPSTSMV YLPVAMKQP