Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0413 |
Symbol | |
ID | 8413262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 476774 |
End bp | 478741 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 645021981 |
Product | Glucan-binding protein C |
Protein accession | YP_003179435 |
Protein GI | 257784218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGG TCTCAACCTC TTCTTTTACG AGAGTAGCTG CAGTGTCATT ATCTCTTCTT TTGATATTGA CTCTTTTTCC TCTTAATCCT GTAAAAGCTT ATGCAACAGA TCCTGCAAAA GGTATAGAGG TTCGTGAAGA TGCTTCTAAA TTAGAGGCGA GTGTTGCAGC AGCAAAGGCA GCTGGTGTAG ATGTTCAAAA AGATGATGAT GTTGACAAGG GTTCTGTCGA ATCGTCATCA GATATTGATG CTAAAAAGTC TGAGATTCAA GATGATTACA ACAAACAATC TCAAGATTTA GATGCTATTA CAGAAGAAGC TAAACAGAAG CTAGCAGACT ATGCTACAAA AAAGGCTGCT TATGATACCG CCAAGGCTAA GTATGACGCA GACAAAGCTC AGTATGATGC AGACAAAGCT CAGTATGATG CAGATATGAT TGCTTATAAT AAAGCTATGG CTGAGCTTGA ACAGAAAAAG AACGAAGATG GCTATATGAC TAAGCCATAT CCGCAGCTTT TAACATTTAA ATCTGAGCCA AATGCAGTAC TTACTCTTTC AGGCAGAAAA TATACACATG ATGAGTTTAG TGCAGAAGTT AGATCTTGGA ATCTTGGATC TGAGCCATGG AGATATTCAT ACTTTGATGC ATTAAATAAT GGACAGGCTG CTAATGCAGC ACGTGTTATG TTAGAAAAAG ATAAGCCTTT CACTGCTACA TATACAAATT TGACTAATTC TAGCTATAAC GGTAAGAAGA TTTCTAAAGT TGTATATACC TATACGTATA AGGGTTCTTC GGGAGTAAAC GTACCTAATA AGCTTCCAGT TGTCTTGCAA AAGGACCCTA CGGTTACTAT TTGGTATAAC GATTTCTTCG GTGATGCTCG AATTAATGTA ACTGTTAAAT TTTATGATGA AGACGGTAAT GTAATTGACC CAACTGGTTC ATTACTGAGT TTTTCTTCAT TAAATAGAGG AAATGGATCC GGTGCAGTTG ATAAAGATGC AATTGAAAAG GTTGGATACT TTAACGGCGA ATATGTGCCT ATTTCAGGAT CAACAATTAA ACCCCATGCT GATGGCAGCG CGTATTCAGA TACAAATAAT GCTGAAAAAG CTTATGGTTC CAGATTTAAT ACAGCTGACT GGGATACACC AACTTCTCCT AAAGCATGGT ATGGTGCCAT TGTTGGCCGA GTAACAAGTC CTGAAATTAG TTTTGATATG GCCTCTCATA AGAGCGGCAT TGTTTGGTTT GCTCTCAATT CGGATATTAA GGCAATTAAT GTGCCCACCA AACCAGTTGA GCCAACACCA CCTACACCTC CAGCAGAAGA GCCTGAGAAG CCAACATTTA GCGCTCGATA TCATTTGGAC GTATTCTATG TAAAGCCTCA GTTAGAGAAG AAAGCATTAA GTGAGGATGA TAAGGATATT AATTCCAATA CAGTAAAGAC TAATTCTGTT GTCAAATTTG CATTGAATAC TACGCCTTTT CCAGCAGGGC ATGAAAAAAT TGACTCTGTA GTTTTCCATG ATGTATTACC CGAGGGTTAT GAAGTCAATT TGGAAGACAC AAAGAAGGCA AGTCCTGATT ATGAAGTAAG TTACGATGAA GGTACGCGTA CGCTTGTCTT TACAGCAAAT GCTTCTTTGC TCAGTCAGAT TAATGCTGAT CCAACTAAGG AAGCTGATGT TCCTGCTCCT GTAATCGTTG GTAAAGTTAC AAAAGATGGA GCTGTTTACG AAAACGACTT TGATATAGAT ATTAATAATA CCTATACAAG AAGCTCAAAT AAGGTAACGG TAAAGACACC TGAGCCTCCT AATCCGCCAA AGAAACGTAA AAAGAAGGCT AAAACTCCAT ATACAGGAGA CGCAGGAGTA TTTTCTTCTA TTGCTCTTTG TACAGGATCA ATAGCAGTAT TAGGTGGTTC TTGGTTTATT AAGAAGAAAA AGAAATAG
|
Protein sequence | MKRVSTSSFT RVAAVSLSLL LILTLFPLNP VKAYATDPAK GIEVREDASK LEASVAAAKA AGVDVQKDDD VDKGSVESSS DIDAKKSEIQ DDYNKQSQDL DAITEEAKQK LADYATKKAA YDTAKAKYDA DKAQYDADKA QYDADMIAYN KAMAELEQKK NEDGYMTKPY PQLLTFKSEP NAVLTLSGRK YTHDEFSAEV RSWNLGSEPW RYSYFDALNN GQAANAARVM LEKDKPFTAT YTNLTNSSYN GKKISKVVYT YTYKGSSGVN VPNKLPVVLQ KDPTVTIWYN DFFGDARINV TVKFYDEDGN VIDPTGSLLS FSSLNRGNGS GAVDKDAIEK VGYFNGEYVP ISGSTIKPHA DGSAYSDTNN AEKAYGSRFN TADWDTPTSP KAWYGAIVGR VTSPEISFDM ASHKSGIVWF ALNSDIKAIN VPTKPVEPTP PTPPAEEPEK PTFSARYHLD VFYVKPQLEK KALSEDDKDI NSNTVKTNSV VKFALNTTPF PAGHEKIDSV VFHDVLPEGY EVNLEDTKKA SPDYEVSYDE GTRTLVFTAN ASLLSQINAD PTKEADVPAP VIVGKVTKDG AVYENDFDID INNTYTRSSN KVTVKTPEPP NPPKKRKKKA KTPYTGDAGV FSSIALCTGS IAVLGGSWFI KKKKK
|
| |