Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2981 |
Symbol | |
ID | 5734853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3762823 |
End bp | 3765396 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280125 |
Product | Ig family protein |
Protein accession | YP_001545747 |
Protein GI | 159899500 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTTC GGCGAACAGC ATACGGGCTA GGATTAAGTT TATTGTTAGC GACCCTCCCC CACGCAAGCT TGGCAACCCC AGCGCTCACC ACAACTGGCG TTCCAACCGA CCCCACTCCG GCAATCAAAC TCACGCGAAT TGGCCGTTAT AACCCAGGCC CATTTCGTAG CGCTGATCCA CGGGCAGCTG AAATTGTCGA TTTTGATCCG CAAAGCCAGC GCATGGTCTT GATCAATGGC TTTAACAGCG CCTTGGATAT TGTTGATCTG AGCAATCCGG CCAACCCGCA GTTGCTTACA ACGATCGCCA TTACGCCCAC CAGCAGCAAT GTGCCCAACA GCGTGGCCGT GCACAATGGC TTAGTCGCGG TGGCCGCCAA TGCTGCCGTC AAAACCGATC CTGGGCGGGT GGTGTTGTTC AATCGCGATG GCGTGTTTTT GAATGAAATC ACAGTTGGAG CAGTGCCCGA TATGCTGACC TTCACGCCCG ATGGCCGCCG GATCGTGGTG GCAATTGAGG GCGAACCCAA CAGCTACAAC CAAGTTGATT CGGTTGATCC TGAGGGGGCG GTGGCGATTA TCGATTTGCC GCAAAATTTT GCCAACATTA CAACCACCAG CGTGCTTTCA TCAAGCTTGG TTGGCTTTAC TGATTTTAAT CTGGGTGGCA GTCGCCATGC TGAGCTTGAC CCGCAAATTC GAATTTTCGG GCCAAACGCC AGCGTCGCCC AAGATTTAGA GCCGGAATAT TTAACAATCT CTGCCGATTC GAGCAAAGCC TATGTGACGC TGCAAGAAAA TAATGGCTTG GCCTTGATCG ATCTGAATGC AGGGCGGGTG CAATGGCTCA AAGCTTTGGG CTATAAAAAT CACAATCTCG CGGGCTATGG GCTTGATCCC AGCGATAGCG ATGGCATGAA TGCAATTGCG CCATGGCCTG TGTTGGGTAT GTATCAGCCA GATACGATTA ATAGCTATGC TGCCAATAAC CAAACCTATT TGGTAACTGC CAATGAAGGT GATGCCCGCG ACTACACCGG ATTTACCGAA GAAGTGCGGA TCAAAAATGT GATGCTTGAT TCGAGCGTGT TTACCAACGC TGCCAGCCTG CAACAAGATG CCCAACTTGG GCGCTTGAAT ATCACCAATA CTAAGGGCAA CTTTGGCGGG CAGCACCATG CACTCTATTC ATTTGGCGCA CGCTCATTCT CAATTTGGGA TGGTACGACA GGTCAGTTGG TATTTGATAG TGGCGATGAT TTGGAAACCC GGACTGCCGC TACGTTTCCA AATAATTTTA ATGCCAATAA CACCGCCCAC AGCCGCGATA ACCGTAGCGA CGATAAAGGC CCAGAGCCAG AAGCTTTGGC GGTAGCGACG ATTGATGGCC GCAGCTATGC CTTTGTCGGC TTGGAGCGGA TGGGCGGAAT TATGGCCTAC GACGTGAGCA ACCCGCACGC GCCCCAATTT CTCGAATATT TCGCTGCGCG TAGCTTCCCC AGCAGCTATG TTACTGGCAC GCCCGATGAT CTTGGGCCTG AAGGCATGCA TGTGATCGCC GCCGAAGATA GCCCAACTGG CAAGCCCTTG TTGTTGGTTG CTAACGAAGT GAGCGGCTCG GTTTCGATCT ATCAAATTAG CGCCCAAACT CCTCGCATGC ACTTGAACCT GAGCGATGGC TTAACCAGCG TGCAACCCAA CACCTCGGTT ATTGCCTCGC TGAGCTTGAA TAATCAACAA ACTGAGCCAA GCGCTCGCCC GGCAACTGAA GTCCAAGTGC AGTATCTTGT GCCAAGCCAA TTAAGCTACA ACGGTTGTAC AATTGCCAGC CCCTTGGCGG GCACATGTAG CCAGCAAAAT GGCCTAGTAA CCTTCAATCT GACCACACCA TTTGCCTCGG CTAGCCAAGG CTTGTTGCAG GTTGCCACCA CGGTCAAGCC CAATGCCACA GGCACAATTG AGCATCAAGC CAGCCTCAGC TATCGCGATG CTGGCGAATT GCAAACCACG GTTCAAGTCA GCGATACGAC CACAATTGGC GTTGCACCGT TGATTACCAG TGGCTTGCCC ACGGCGGCGA GCTATGGCGC GATCTATAGC CACACCCTGA CGGCGAGCGG CATGCCAACC CCAACTCTCA ATCTTGTTGG CAACTTGCCA GCAGGCTTGA GCTTCGATAG CCAAACTGGA ATTTTGGCGG GTACGCCGAC CACCAGCGGT AGTTTCCCAA ATTTGATCTT CCAAGTGAGC AACGGAATTG GTACAATGGT AACGCAAAGC TTTACGCTGA CCGTTGCCAA AGCGCCATTG CAGGTCGTTG CTGATAACCA ACGTCGTTTA TTCGGCCAAC CCAACCCGCC CTTGAGCTAT CAAGTAACTG GCTTGCGCTT GCAAGATACG GCTGCAAGTG CATTAACTGG CACATTAACC ACCACCGCAA CCCTCACCAG CCCGCTTGGT GAGTATCCAA TTAGCCAAGG TAGTTTGCAG GCTCAACACT ACCAAATGAG CTTTAGCGCT GGCATACTCA CCATCGAAGC CAACGCGGTT TACCTACCCT TGATTGGGAA ATAA
|
Protein sequence | MRFRRTAYGL GLSLLLATLP HASLATPALT TTGVPTDPTP AIKLTRIGRY NPGPFRSADP RAAEIVDFDP QSQRMVLING FNSALDIVDL SNPANPQLLT TIAITPTSSN VPNSVAVHNG LVAVAANAAV KTDPGRVVLF NRDGVFLNEI TVGAVPDMLT FTPDGRRIVV AIEGEPNSYN QVDSVDPEGA VAIIDLPQNF ANITTTSVLS SSLVGFTDFN LGGSRHAELD PQIRIFGPNA SVAQDLEPEY LTISADSSKA YVTLQENNGL ALIDLNAGRV QWLKALGYKN HNLAGYGLDP SDSDGMNAIA PWPVLGMYQP DTINSYAANN QTYLVTANEG DARDYTGFTE EVRIKNVMLD SSVFTNAASL QQDAQLGRLN ITNTKGNFGG QHHALYSFGA RSFSIWDGTT GQLVFDSGDD LETRTAATFP NNFNANNTAH SRDNRSDDKG PEPEALAVAT IDGRSYAFVG LERMGGIMAY DVSNPHAPQF LEYFAARSFP SSYVTGTPDD LGPEGMHVIA AEDSPTGKPL LLVANEVSGS VSIYQISAQT PRMHLNLSDG LTSVQPNTSV IASLSLNNQQ TEPSARPATE VQVQYLVPSQ LSYNGCTIAS PLAGTCSQQN GLVTFNLTTP FASASQGLLQ VATTVKPNAT GTIEHQASLS YRDAGELQTT VQVSDTTTIG VAPLITSGLP TAASYGAIYS HTLTASGMPT PTLNLVGNLP AGLSFDSQTG ILAGTPTTSG SFPNLIFQVS NGIGTMVTQS FTLTVAKAPL QVVADNQRRL FGQPNPPLSY QVTGLRLQDT AASALTGTLT TTATLTSPLG EYPISQGSLQ AQHYQMSFSA GILTIEANAV YLPLIGK
|
| |