Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2192 |
Symbol | |
ID | 5734079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2776447 |
End bp | 2778759 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279333 |
Product | hypothetical protein |
Protein accession | YP_001544960 |
Protein GI | 159898713 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000184948 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCGTA AACGCTATCT TATGCTAATC GCCCTCGTTT TAGCCAGCAC ATGGCTGCTA CCGGTTGCTG CCCAAGATCA AACCAAGCAG CAACCATCGG CAGTTTTAGC ACCTTCAGGC GAAACAATTA CCTACCAAGG CATGTTACGC GAAAGTGGCA GCTTGGCAAA TGGTGTGTTC GATTTTCAAT TTGGTTTGTA CGTTGACGCA GTCGGTGGCA CAGCTTTAGG CGTAGTCACC CGCAATGATG TGACTGTGAG CAATGGTTTA TTTACTAGCG AACTCACCTT TAGCGAAGGC TTGTTCAATG GTGAAAGTCG CTGGATCGAG CTAGCAGTTA AGGCCGATGC TAGCGGCAGC TATACCACGC TCACGCCCCG CCAACAAGTG ACAGCCGCGC CCTTGGCTTT AGCCTTGCCA GGCTTCTGGA CACGCCAAAA CAGCTTTAGC CCCAACTTAA TTGGCGGCTA CGAAGGCAAT ACTGCCTCAT CGTTGGCGAT TGGGATCACA ATTAATGGTG GTGGCAACTC AGGTGGCCTG AATAGCGCCT ACGATAACTA TAGTTCAATT GGTGGTGGGG CTGGCAATAG CGTCGGCAGC AACGATGGTA GCCCAAGCAA CGATGTCTAT TCAACCATCG GTGGGGGCAT CAACAATACC GCCAGCCAAG AATACATTGT GATTGGCGGT GGTCAAACCA ACAACGTCAG CGGCGCTTGG TCAACGATCG GTGGTGGCAT CAACAATGTG ATTAATAATA GTCGCTATAG TGTAATTGCT GGTGGTGGTG GCACAACTGG CAACACAATT TACGATGATT ATGGCACAAT TGCTGGGGGT AGCGAAAATA TCGCAGGTTT AACTGGCGAG GCCGTCAGCC AAATGTATGC CACTGTTGGC GGTGGTCGGG CCAATGTTGC CAGCGATAAC TATGCGATCG TCAGTGGTGG CCGCAGCAAC ACTGCCAGCG ATGATTACTC AACAGTTGCT GGTGGCTACA ACAACAGTGC TGCTGCTCAA TATGCCACAA TCAGCGGCGG TGGCAATTCC AACAGTGCCC AAGCCAATCG TGTTTACGAC GATTATGGCA CAATCGGCGG TGGTACAAAT AACGTAGCGG GCGTAACTGG CGATACAACT GGTCAGCAAT ATGCCACGGT TGGTGGTGGC AATGGCAATA CCGCCAGCGA AGATGGTGCG ACGGTTGCTG GCGGCAGTGG CAATAGCGCT AGCGCCAACT ACACCACAGT TGCTGGTGGC ACTACTAATG CTGCCAGTGG CGATACCAGC AGCATCGGTG GTGGCCAACT GAATGCAGCC AGCGGTGGCT ATTCAGTAGT TGCTGGTGGA CGCGGCAATA CTGCTTCTTC CAATATCGCC GGAGTTGGTG GTGGCCAATC GAATCAAGCG ACCAACACTG GCGCATATGT CGGGGGCGGC CAGACCAATA CAGCGACTGG CCAATATTCG GTCGTGGCTG GTGGTGTCAA TAATGATGCC ACTAATACCT ACGCTTCCGT AGTTGGTGGT ATCAACAATC AAGCGGCAGG TGCGGGTACA TTTGTCGGCG GTGGGCAAAA CAATAATGCC AACAGCACCT TATCGGCAAT TCTGGGTGGT AGCGGCAATA CAACCTTGGC CGATTATACC GTTGCCGCTG GCGAAAATGC CGTCGCTGCT CACGCTGGGA GTTTTGTTTG GGCTGGTCAA CAAGCCAACA AAGATGATAG TATTTCGACG ACTGGGCCAG GTCAGTTTAT CGTGCGTGCT CCAGGTGGGG CGTGGTTTGG CAGCAGCACC AAGGTCGATA TGCCTGATGG CGCGATTTTG GCAACTGAAA GTGGCGCTTT CCTGAGCAAA GGCGGTACAT GGGCCAATTC ATCAGATAAA AATCTCAAAT CAAATTTCGC CACGATCGAT CCCCAAGCTG TGCTCGATCA GCTTGCCAGT ATTCCCGTAC AAGCTTGGAG CTACAACAGC GAAGGTGCAG CAGTTCGCCA TATTGGCCCA ACCGCCCAAG ATTTTTATGC CGCCTTTGGT TTAGGCACTG ATGATCGCCA CATTGCCACG GTCGATGCTG ATGGCGTGGC CCTCGCTGGA GTCCAAGGTT TATACAATTT AGCCACCGAA CAAGCTCAAT TACTCGATCA ACAAGCCGAG CACATGGCGG CACTTGATGC ACGACTCGCA GCCTTGGAAC ATAGCCAAAA TCCTCAAACC AGTCTCCCAT GGTTGTGGTT GATCGCGATT GCCGCAGTTG GATTGGGCTT GGGGTGGATG CTTGGTCGTC GCAGCAAGGG GCAACGCGCA TGA
|
Protein sequence | MIRKRYLMLI ALVLASTWLL PVAAQDQTKQ QPSAVLAPSG ETITYQGMLR ESGSLANGVF DFQFGLYVDA VGGTALGVVT RNDVTVSNGL FTSELTFSEG LFNGESRWIE LAVKADASGS YTTLTPRQQV TAAPLALALP GFWTRQNSFS PNLIGGYEGN TASSLAIGIT INGGGNSGGL NSAYDNYSSI GGGAGNSVGS NDGSPSNDVY STIGGGINNT ASQEYIVIGG GQTNNVSGAW STIGGGINNV INNSRYSVIA GGGGTTGNTI YDDYGTIAGG SENIAGLTGE AVSQMYATVG GGRANVASDN YAIVSGGRSN TASDDYSTVA GGYNNSAAAQ YATISGGGNS NSAQANRVYD DYGTIGGGTN NVAGVTGDTT GQQYATVGGG NGNTASEDGA TVAGGSGNSA SANYTTVAGG TTNAASGDTS SIGGGQLNAA SGGYSVVAGG RGNTASSNIA GVGGGQSNQA TNTGAYVGGG QTNTATGQYS VVAGGVNNDA TNTYASVVGG INNQAAGAGT FVGGGQNNNA NSTLSAILGG SGNTTLADYT VAAGENAVAA HAGSFVWAGQ QANKDDSIST TGPGQFIVRA PGGAWFGSST KVDMPDGAIL ATESGAFLSK GGTWANSSDK NLKSNFATID PQAVLDQLAS IPVQAWSYNS EGAAVRHIGP TAQDFYAAFG LGTDDRHIAT VDADGVALAG VQGLYNLATE QAQLLDQQAE HMAALDARLA ALEHSQNPQT SLPWLWLIAI AAVGLGLGWM LGRRSKGQRA
|
| |