Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4363 |
Symbol | |
ID | 5736223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5573524 |
End bp | 5575425 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281524 |
Product | cell wall anchor domain-containing protein |
Protein accession | YP_001547123 |
Protein GI | 159900876 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTGC TACGTGCGTT CGCGTTGCTG CTATTGCTGG CAGTTTTGGC TCCACTCGGC TTGCAGAGTG GGGTTGTAAG TGCCCAAAAT GAGCAACGCA CGGTCACTAT TTTGGAAACG AGCGATATCC ACGGCAATTT GATGGCTTGG GATTATTACG CCAACAAGCC TGCCGAATGG GGTATGACCA AGGTTGCAAG CTTGATCAAA CAAGAACGGG CGATCGATCC CAATCTCTTG TTGGTCGATA ATGGTGATAC GATTCAAGGT ACGCCCTTGA CCTACTACTA CAACGTGATC GACCAAAATG CCGCGCATCC AATGGCAGCG GTATTTAATG CGCTCAAATA TGATGTTTCA TCGTTGGGCA ACCACGAATT TAATTATGGG ATGGATGTGC TGAATCGCTA TATCAGCCAA GCTCAATACC CAGTTATGAG CGCCAATGTG CGCAAAAGCG ATGGTAGCGA AGCCTTCAAG CCCTACATTA TTAAAGATGT GAATGGCGTG AAAGTAGGCT TTTTGGCCTT GACCACGCCA ACTGTGCCAA CCTGGGAAAA ACCCGCCAAC ATCGCGGGCC TGCAATTTGC CGATCCGGTA GAAGTTGCCA AGCAATATGT ACCGCAAATT CGGGCTGAAG GTGCGCATAT TGTGGTTTTG CTGCAACACA CGGGCTGGGA AAAACAGCCT GCCGAAGCGA CCAAACCCGA AGCATGGCTA ACCGACCCCA GCACTTGGCG CGATACTGGC TCGTTGCCAG GCGAAAATGT GTCGATCAAA CTTGCCCAAG AAGTGCCTGG CGTTGACGTG ATTTTGACTG GTCACTCACA CTTGAGCGTG CCCAAGGCGA TTATCAACAA TGTGTTATTG ATCGAGCCAT CGTATTGGGG CCGCGCTTTG GGCAAAGTGA CGATTACGGT TGAGAAAAAT GGCGATAGCT GGAATGTGGT TAACAAAGAT TCAACCAACA TTTCAGTCAC CAATGTTGCC GAAGATCAAG AGATTAAAGC ACTGGTACAA CCATATCACG ACCAAACCTT GAGCTATATT AGCCAACCAG TTGGTACGGC TAGCGCCGAA TTTGCTGGCG GCCCCAAGGC GCGTTATCGT GATAGCGCTT TGGCCGATTT GATCAACAAT GTGCAAAAGC AAGCCGCTGC TGATGCTGGC TACCCCGTTG ATCTCTCGTT GGCAGCGATT TTCACCGATG GCGGCATGAT TCCGGCGGGC CAAATTACCC TGCGCGATGC CTACAGCATC TATATTTACG ATAATACGCT GTATGTGATG GAAATTAATG GTGATATTCT GCGCCGTGCT TTGGAGCGTA ACGCCGAATA TTTCCGCCAG CTTGATCCCA ATGCCTTGCC CAGCGATCCC AAAGCGGTAG TCAACGATAA TGCCCGCGAT TACAACTGGG ATTTATACAC CGATATCGAC TATAGCTACG ATTTGACCAA GCCAGCCGGC CAGCGTGTGA CCAAATTGCA ATTGAATGGG GTTGATATTA CACCTGAACA AACCCTGCGC ATCGCGATCA ACAATTACCG AGCTGGCGGC GGCGGTGGCT TTGCCATGTT CCGTGAAGGC AAAATTGTCT ATCAATCGAC CAGCGAAATT CGCGATTTGA TCGCTGAGTC AGTCAAAAAT GCTGGCACAA TTGATCCGAC GGTGGTGAAT AAGGTTAATT TTACCCTTGT GCCAGATTTA TATGCCCACT ATTTTGGTGC TGCCAGCCAG CCGACTGCTA CGCCAGTGCC AGCCCAACCA ACTGCCACTC CAGCGCCAGG TGTGCCAATT ACCTTGCCTG ATACCAGTGG TAACCAACCA AGCTATGCCT GGGTTTGGGC GGCTGTCGCC ATGGCCTTAC TCGCTTTAGG TTTGGTTGTG CGCCGCAATT AA
|
Protein sequence | MRLLRAFALL LLLAVLAPLG LQSGVVSAQN EQRTVTILET SDIHGNLMAW DYYANKPAEW GMTKVASLIK QERAIDPNLL LVDNGDTIQG TPLTYYYNVI DQNAAHPMAA VFNALKYDVS SLGNHEFNYG MDVLNRYISQ AQYPVMSANV RKSDGSEAFK PYIIKDVNGV KVGFLALTTP TVPTWEKPAN IAGLQFADPV EVAKQYVPQI RAEGAHIVVL LQHTGWEKQP AEATKPEAWL TDPSTWRDTG SLPGENVSIK LAQEVPGVDV ILTGHSHLSV PKAIINNVLL IEPSYWGRAL GKVTITVEKN GDSWNVVNKD STNISVTNVA EDQEIKALVQ PYHDQTLSYI SQPVGTASAE FAGGPKARYR DSALADLINN VQKQAAADAG YPVDLSLAAI FTDGGMIPAG QITLRDAYSI YIYDNTLYVM EINGDILRRA LERNAEYFRQ LDPNALPSDP KAVVNDNARD YNWDLYTDID YSYDLTKPAG QRVTKLQLNG VDITPEQTLR IAINNYRAGG GGGFAMFREG KIVYQSTSEI RDLIAESVKN AGTIDPTVVN KVNFTLVPDL YAHYFGAASQ PTATPVPAQP TATPAPGVPI TLPDTSGNQP SYAWVWAAVA MALLALGLVV RRN
|
| |