Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2193 |
Symbol | |
ID | 5734080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2779227 |
End bp | 2781599 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279334 |
Product | carbohydrate-binding family V/XII protein |
Protein accession | YP_001544961 |
Protein GI | 159898714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00201516 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGTT TTTTTTCATT GATGTTGGTA TTTAGTTTAT TTGCGAGTCT CTTATTCAAC CAAGCTACTC ACGCCGCACC AAACCCATGG GCAGCCAATA CGGCCTATGC GGTTGGCAGC CAAGTTAGTT ACAATGGCAC GATCTATGAG TGTTTACAAG CCCACACGGC CTTAGTTGGT TGGGAGCCAG CCACCACACC CGCGCTCTGG AAGCAGGTTA GTACTGCTCC AAGCGCTACG ACAATTCCCG CAACAAGCAT TCCGCCAACC GCTCCCCCTG CCACAAATAT TCCGCCAACT GCAACCGCAA CGCCGACGGC TGGCTGTTTA AATCCAGCGG CGGGCGTTGG TTGGCAAAGC CGCAGTTTAA GCAACCAAAC TGGGAATTTT AGTTTGAGCT TTGAGGCAAC GCTGAGCGCT AGCCCCACCA ATGCTGTGAT TGGCTTGGCC AATGCTGTGC CAACCGATTA CACTGGTTTA GCAATTGCTG TGCGCTTTAA TCCAACTGGC TCAATCGATG CCCGCAACGG CGGAACTTAT GCTGCTGCTA CCAATGTGGC CTACAGTGCT AATACACGCT ATCGTTTTCG GATTGTTATC AATCTTGTGG CTCATACCTA CAGTGTTTAT GTTACGCCGC AGACTACTAG CGAAGTTTTG ATTGCCAGTA ATTTTGCTTT TCGCAGCGAA CAAGCCAATG TGAGCAATTT AAACACCTTG GCGACGGTGG TTGGTGGCAC AGGAGCCGTT GGCTCACTCA ATCTTTGCAA TATCAGCCTG AATCTGCCTG CAACTCCAAC GCCGCTGCCA ACCGCAACGC CAACCCCACG ACCAACCGCA ACCCCAGTGC CAACCGCAAC TCCAGCGCCA ACCGGTTATA CCCCAGCTTT TTGTGCCAAC TATCCACCAG CAATCGTCGC AGGCAGTTGG CAATCATCGG TGGTCAGCTA TCACAATGGG CGTTTGCAAT ATACCAACGA TAGTGCCCAA AACCGAATTC CTGATTTTAG CTATGCTGGC TATTATTCAG GCCAACGGCC ACTGCCAAAT CTGGCGGTTG TCCAAACACT CAGCCCAATC AGTGGCGATA ATACCGCCCG CATTCAACAG GCACTCGATG CAATTGGCAA TCGCACGCCC GATGCCAATG GTTTGCGTGG AGCATTATTG CTTGCACCTG GCCGCTACAA CATCAACGGA ACCTTACGCA TCAACAAAAG TGGCGTGGTG CTGCGCGGCA GTGGCGATGG CAGCGATGCT AGCACTTCCA CGATTTTGCT AGGAGTTGGC AACACGCCGC ATCAACGCAC ATTAATTGTG GTGGGCAACG GCGATTCAAC CCCGTGGACG GCTGGCTCCG CCACCAACGT GACCGACCAA TTTGTACAAG TTGGCAGCAA AAGCTTGAAT GTAGCCGATC CCAGTCGTTT TACGGTTGGC CAAGAAGTGA TTGTGCGCCA CCCATCATCA CAAGCATGGA TCAACGCGGT CAATGGTGGC GGGGTAGTCA ATGACGCTTG GTGGGCGGTC GGTGCTTTAG ATATGACTTG GACGCGCCGA GTAACTAAGA TTGCTGGTAC AACCCTGACG CTTGATGCTC CAATTTTCAA TCATTTGGAT CGGGCGTTGA GCCAAGCGAC GGTTGCTCCG GTTGCCAGCC GCACTATCAT CGCCAATGCT GGGGTAGAAA ATCTGCGGGT TGATATTCAG ACCGCTGGCG GCGAGGATGA GAACCACGTT TGGGATGCAA TTGGGATTGT CGGGGCAGAA AATAGCTGGG TCAAAAATGC GACAGTCTTG CACTTTGGCC ATGCTGGGGT GTTTACTCAA GGCGCAATTC GCATCACGGT TGAAGATGTG CAGGCACTTG ATCCAGTTGG CATTCGGACT GGTGGCCGTT TTTACAACTT CGATGCCGAA TCGAATAGCC AACTCGTGTT GTTTACGCGG GTTCATGCCA CTGGCGGTCG CCACAACTTT ATTTCCAATG GAACTCAAAC CACCTCGGGG ATTGTTTGGC ATCGTTCGAC TGAAGGCGGC GGCTCGGATA GCGAAGGCCA TCGTCAATGG AGCCAAGGTC TATTGTTCGA CACCATTAAT GCTAGTGCCG CCAGCAATAT CAAGCTGATC AACCGTGGCG ATTATGGCAC ATCGCATGGC TGGGGCAATG TGCATTCAGT CATCTGGAAC TACAATCGCA CGATGATGGT GCAAAAGCCG CCAACTGGCC AAAACTATGT CATCTCACAG GCTGGCACGC GTAGCACCTC GTATCCCTTC CCGGGCGCTG GTGGTTTTGC CGATATTCGC AGCGGCAGTT TAGTACCCAA TTCGCTCTAC GAAGCTCAAC TTTGTGATCG GCTAGAACAG TGA
|
Protein sequence | MQRFFSLMLV FSLFASLLFN QATHAAPNPW AANTAYAVGS QVSYNGTIYE CLQAHTALVG WEPATTPALW KQVSTAPSAT TIPATSIPPT APPATNIPPT ATATPTAGCL NPAAGVGWQS RSLSNQTGNF SLSFEATLSA SPTNAVIGLA NAVPTDYTGL AIAVRFNPTG SIDARNGGTY AAATNVAYSA NTRYRFRIVI NLVAHTYSVY VTPQTTSEVL IASNFAFRSE QANVSNLNTL ATVVGGTGAV GSLNLCNISL NLPATPTPLP TATPTPRPTA TPVPTATPAP TGYTPAFCAN YPPAIVAGSW QSSVVSYHNG RLQYTNDSAQ NRIPDFSYAG YYSGQRPLPN LAVVQTLSPI SGDNTARIQQ ALDAIGNRTP DANGLRGALL LAPGRYNING TLRINKSGVV LRGSGDGSDA STSTILLGVG NTPHQRTLIV VGNGDSTPWT AGSATNVTDQ FVQVGSKSLN VADPSRFTVG QEVIVRHPSS QAWINAVNGG GVVNDAWWAV GALDMTWTRR VTKIAGTTLT LDAPIFNHLD RALSQATVAP VASRTIIANA GVENLRVDIQ TAGGEDENHV WDAIGIVGAE NSWVKNATVL HFGHAGVFTQ GAIRITVEDV QALDPVGIRT GGRFYNFDAE SNSQLVLFTR VHATGGRHNF ISNGTQTTSG IVWHRSTEGG GSDSEGHRQW SQGLLFDTIN ASAASNIKLI NRGDYGTSHG WGNVHSVIWN YNRTMMVQKP PTGQNYVISQ AGTRSTSYPF PGAGGFADIR SGSLVPNSLY EAQLCDRLEQ
|
| |