Gene Haur_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1249 
Symbol 
ID5733127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1457794 
End bp1459932 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content50% 
IMG OID641278389 
ProductPKD domain-containing protein 
Protein accessionYP_001544025 
Protein GI159897778 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.8229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGTT GGATTATTAT CGTTGTTCTG ATCTTTTCAG GTTTTTCTGC CCAATTTGTT 
CAACAATCGA ACCCTGAGCT TTCCCCTGCT GCTGTTCCTG CTGGCTTCAC CCAAACTGTG
GTTGTACCCT CGAATAGCCT TGATGCTCCA ACTGCCTTTA CTTGGTTGCC CTCTGGTGAG
ATGTTGATTA CTCAGCAAAA TGGTCAGTTG TTGGGTTGGA ATGGCACAAG CACGCGCACC
GTAATGAGCC TTGGCAATCG AGTTTGTTAC GATTTTGAGC GTGGTTTGCT GGGCATCGCG
GTTGATCCCC AATTTACCAG TGGTCGTCCA TATGTCTATG TCTACTATAC CTTTAATAAA
TTTAACCAAA CTTCGAACAA TTGCCCTCGC CAAAGCCCCA GCACCAATCC GGTCAATCGA
GTTTCACGCT TTACTTGGAG TAATAATGTG CTCGATATCA ACTCGGAATC GGTCTTGATC
GACAATATTG GCTCATATAA CGGCAATCAC AATGCTGGCG ATCTTGGCTT TGGCAAAGAT
GGCAAGTTGT ATATCAGCGT TGGCGATGGC GGTTGCGATT ATCTCGATAG TGGCTGTGGT
GGCGCAAACG ATGCCTCGCG CGAACAGCAC ACCTTGCTTG GCAAAATTTT ACGCATCAAT
GCTGATGGTA CGATTCCTAG CGATAATCCG TTTACTGGCA GCGGCACGGC CCGTTGTAAT
ACTGGCTCAG TGGCTAGCGG CACGATTTGC CAAGAAACTT GGGCTTGGGG TTTTCGCAAT
CCCTACCGTA TAACCTTCGA TCCCAATGCT AGCGGCGTGC GCCTGTTTGT CAACGATGTT
GGCCAAAATG TGCGCGAAGA AATCGACGAA GTTGTGGCGG GCAAGGATTA TGGCTGGAAT
TGTCGCGAAG GTACGCGGGT CAATAATTCA ACGGGGCCAT GTTCGCCAAC GCCCGCCAAT
ATGGTTGACC CAATTTATGA ATATAGCCAT GGCAACGCTG GCGCACCATT TACCAACTGT
AATTCGATCA CTGGTGGCGC GTTTGTGCCT GCCAATACTT TTCCTAGCAA TTACAGTGGT
TATATGTTTG GCGATTATGT TTGCGGCAAG ATTTTTATGA TTTCAGCCCA AGCGCCCTAC
AATTCGGTTC TAACTTTCTC AGATGATCCT GGATCAGTCA CGCATATGGC GTTTGGCCCG
AATGGCGGTC GCCAAGCGTT ATTCTATGCG ACCTATGCTA ATGGTGGCGA GATTCGCCGA
ATTAGCTATG ATGGTAGCAC CAGCTTGAAT TCTTCGTTTA CAGCCAACCC CAGTTTTGGC
GCGGCTCCCT TGGCCGTAAC CTTTACCGCT AGCAATCCAA GCAGCGGCGC AAGCTATTTG
TGGAATTTTG GCAATGGCAC GAGCCGCGAA ACTAGCACGG CCAGCACATC CTACACTTAC
GCCAACAATG GCACCTACAC TGCAACCCTG TATTTGCGCG GCAGCAATGG CGATTTATCG
AATGTGAGCC AGGCGATTGT GCGGGTTGGC GCAACTGCGC CGAACGCCAG CATCACCCAA
CCAAACTCAA GTGCCCAATT TGCGGTTGGC CAAACAATCC AAGTGCGGGG GCAGGCCAGC
GATGCCGAAC AAGGCCAATT GCCAGCCAGC GGTTTATCGT GGAAAGTAAT TTTGCATCAC
GATACCCATA CCCACCCCTA TTTGACCCAA CCAACCACCA ATAGTTTTAG CTTTACTGCG
CCAGCCCCCG AAGATTTATT GGCGGCCAGC AACAGCTATT TGGAGCTTGA GCTGACTGCG
ACTGACGATA GTGGCTTGAG CCATGTTGTT ACCCAAACTA TCCAGCCCCA TAAAGTCAAT
GTGACCTTGG CCTCAACCCC GAATGCTAAC GCCAACTTTG TGGTCAACAA CGACCCGATC
GAAGCTGATG ATCCGTTTAT TTCGTGGGAA AACTACAGCC TACGCGTGAC TGCACCAGCC
TATGCTGATA GCAATCGTTG GTGGCGTTTT GTGCGTTGGA GCGATAACAA CACCAGCAAT
CCGCGCACCT TTACGACACC TGCCAGCGCT ACGACTTACA CCGCGGTCTA CGAAGAATTT
ATCCCCTATC AGCTCTATTT GCCAGTTGTG CGCAAGTAA
 
Protein sequence
MSRWIIIVVL IFSGFSAQFV QQSNPELSPA AVPAGFTQTV VVPSNSLDAP TAFTWLPSGE 
MLITQQNGQL LGWNGTSTRT VMSLGNRVCY DFERGLLGIA VDPQFTSGRP YVYVYYTFNK
FNQTSNNCPR QSPSTNPVNR VSRFTWSNNV LDINSESVLI DNIGSYNGNH NAGDLGFGKD
GKLYISVGDG GCDYLDSGCG GANDASREQH TLLGKILRIN ADGTIPSDNP FTGSGTARCN
TGSVASGTIC QETWAWGFRN PYRITFDPNA SGVRLFVNDV GQNVREEIDE VVAGKDYGWN
CREGTRVNNS TGPCSPTPAN MVDPIYEYSH GNAGAPFTNC NSITGGAFVP ANTFPSNYSG
YMFGDYVCGK IFMISAQAPY NSVLTFSDDP GSVTHMAFGP NGGRQALFYA TYANGGEIRR
ISYDGSTSLN SSFTANPSFG AAPLAVTFTA SNPSSGASYL WNFGNGTSRE TSTASTSYTY
ANNGTYTATL YLRGSNGDLS NVSQAIVRVG ATAPNASITQ PNSSAQFAVG QTIQVRGQAS
DAEQGQLPAS GLSWKVILHH DTHTHPYLTQ PTTNSFSFTA PAPEDLLAAS NSYLELELTA
TDDSGLSHVV TQTIQPHKVN VTLASTPNAN ANFVVNNDPI EADDPFISWE NYSLRVTAPA
YADSNRWWRF VRWSDNNTSN PRTFTTPASA TTYTAVYEEF IPYQLYLPVV RK