Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1931 |
Symbol | |
ID | 5733820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2339802 |
End bp | 2341856 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279075 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001544702 |
Protein GI | 159898455 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0195615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACCC CAAAACCTAT CGTTTATGCC CGCATGCTGC TTGGATTGTT ATTGATTAGC ATAAGTTTTG GCCTTGGATT GCCGCGAATC GCCGCCCAAA CTCCCCGTGA CGAGGCAACG GAGCGCTTAA ACTCAGCCGA TTGGGCGGCA ATTCAAGGCC TGCTTGCACC AACTACGGCG ATCTCTGGCT CCAAATTTCA AGCTGGCTAC TTGAAGGCCG CCCAAGTTTC GGCTGGCGAT AGCTTTGGCG CGAGTGTGGC AATTTCTGGC GATACGGTGG TGGTCGGCGT TCCAGCCGAG TCAAGCAGTT TGGCAGGTGT GCAAAATAGT GCTACGCCGA CCATCAATGC TTTGGCTGCC CAAGCCGGTG CCGCCTATGT GTTTGTACGG GGGAGTGCTG GTTGGCAGCA ACAGGCCTAC CTCAAAGCCT CTGAGGTGAG CGCCAATGAT CAATTTGGTT GGAGTGTGGC GATCTCCGGC GATACGATTG TGGTTGGCTC ACCGAGAGAG AGCAGTAGTA CGGTTGGCGT GCAAAATAGT GCGACACCAA CCGTCAATAA TGATCTTGGT GGGGCTGGCG CGGCCTATGT GTTTGTGCGC ACTGGCTCAA CTTGGAGTCA ACAAGCGTAC TTCAAAGCCT CACAAGTTAC ATCTGGCGAT TGGTTTGGCT GGAGCGTGGG TTTAGCATCA AACACGATTG TGGTTGGAGC TTATGGCGAA GATAGCAATA TCGTTGGCGT GCAAAATAGT GCTACGCCGA CCGTCAATGA AGCGGCTACC GCAGCGGGTG CGGCCTATAT ATTTGTGCGC ACTGGCTCAA CTTGGAGCCA ACAAGCCTAC CTTAAAGCTG CCCAAATTAA TACTGATGAT GTGTTTGGGT GGAGTGTTGC AGTAGCTGGC GATACGGTGA TCGTTGGTGC GCCAGGTGAG GACACCAGCA TTGCAGGAGT GCAACATAGT GCTACGCCGA CGGTGGATGA AACAGCTCTC GCAGCAGGGG CAGTCTTTGT ATTTACCCGC AGTGGCTCAA GCTGGAGTCC GCAAGTCTTT GTGAAGGCCT CACAGGTTAC ACCTGGCGAT CAATTTGGCT ATAATCTGGC GATCGCTGGC AATACGATTG TGGTTGGCGC TCCCTACGAA GATTCAAGCA CCTCGGGCGT GCAGCATGGC GCTAGCCCAA GTGTCGATGA GCTAGCATCG TTCGCTGGAG CTGCCTTTGT TTATACCCAA AACGCGGGAG TATGGAGCCA ACAAGCCTAC CTCAAAGCCT CGAATGTGGC GGATGGTGAT CGCTTTGGCA TGAGTGTGGC GATTGATGCC AATACGATTG TGGTTGGTGC GCCGGAAGAG GATAGTGGCA TTGCTGGGGT GCAAAATAGT GCACTACCCA ACGCTGATGA GAGCGCCATC CAAGCAGGAG CCGCCTATGT GTATGCACGG AATGCTACAA CTTGGAGCCA GCAAAGTTAT CTAAAGGCTT CGCAGGTTTC GACTGGCGAT TATTTTGGGC GTGGGGTTGG GGTGGCGGGC GATATGCTGG TGATTGGGAT TCCGCTTGAA GATAGTGCTC GCAGTGGCAT CCAAAATAGT GCTACGCCGA GTGTTGATGA ACTTGCTGCT GATTCAGGTG CGGCCTTAAT TATTGATACA ACCTATCGTT CGTATAGCCC GTTGGTTCAG CGCATGACCC TCTTGGCCTT GCTTACGATC AATTCGACGG CTATTCCCAT TCGGGCGGTG ACGCAACAGG GCGAAGTATT TGCAAGCTTC ACGGCAACAT TGCCTACCAC GATTCCAGCT GGTGGCCATA TTTATCTTTC GGCTAGCCCT AGCTCGTTGC AACCAACCTT GGTCGATGAT CGGATTATCA TTCGCGATGG GGCTAACATA ATTTTTCAAC ATACCTACGA TCTTGAGAGC AATGGCGAGC TTGTTGAGAT TCCGTGGGAA GTTATTAATG CGGCTAGCGG CCATAGCCTG ACGATCACTT TTGTTGATGT CTCGGCAGGG TTGGTTGGGG CAACGCCGAT CTATCTGATT TGGGTGGCTG AGTAG
|
Protein sequence | MSTPKPIVYA RMLLGLLLIS ISFGLGLPRI AAQTPRDEAT ERLNSADWAA IQGLLAPTTA ISGSKFQAGY LKAAQVSAGD SFGASVAISG DTVVVGVPAE SSSLAGVQNS ATPTINALAA QAGAAYVFVR GSAGWQQQAY LKASEVSAND QFGWSVAISG DTIVVGSPRE SSSTVGVQNS ATPTVNNDLG GAGAAYVFVR TGSTWSQQAY FKASQVTSGD WFGWSVGLAS NTIVVGAYGE DSNIVGVQNS ATPTVNEAAT AAGAAYIFVR TGSTWSQQAY LKAAQINTDD VFGWSVAVAG DTVIVGAPGE DTSIAGVQHS ATPTVDETAL AAGAVFVFTR SGSSWSPQVF VKASQVTPGD QFGYNLAIAG NTIVVGAPYE DSSTSGVQHG ASPSVDELAS FAGAAFVYTQ NAGVWSQQAY LKASNVADGD RFGMSVAIDA NTIVVGAPEE DSGIAGVQNS ALPNADESAI QAGAAYVYAR NATTWSQQSY LKASQVSTGD YFGRGVGVAG DMLVIGIPLE DSARSGIQNS ATPSVDELAA DSGAALIIDT TYRSYSPLVQ RMTLLALLTI NSTAIPIRAV TQQGEVFASF TATLPTTIPA GGHIYLSASP SSLQPTLVDD RIIIRDGANI IFQHTYDLES NGELVEIPWE VINAASGHSL TITFVDVSAG LVGATPIYLI WVAE
|
| |