Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1166 |
Symbol | |
ID | 5161714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 1299633 |
End bp | 1302785 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640553080 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001234297 |
Protein GI | 148260170 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000520628 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGTT CGATTACTGT AAGTCCCGCG ACGGGAAACT ACGGCGATGT TGTTTTGAGC GGAAATTATC AAGATTGTAA CACGCCTCTT GTGGCCGTAC TGGTTACGTC GAACGGCACT TATACTCAAA TCGGTCTCAT TTATGACTTC AATGATAGCA CAAATGATGA TGACACGTCG AATAATGGTG GTGGAGATCC ATTTACTTTA ACTGTTTCTC TGCCTACCGG AACATATTCC GTATATTCGA ATAGTTCTGG AAAAGATATC AGTGACGTTC CGATAAGCTC GGGCAGTCAA GGCACGATCG TTTTTATTAA TCCGTCTTCT CCTGACCAAG TCACAAGCTT GGTGACGTCT CTGTCTATTT CATCCGTAGG ATCGCTGGGT CCGACGGGCG CTACGGGTGC CGCCGGCGCA ACTGGAGCCA CGGGTGCCAC CGGCGCGACG GGTGCCACGG GCGCGACGGG TGCCACCGGC GCGATGGGTG CCACCGGCGC AACGGGTGCG ACAGGCGCCA CGGGCGCAAC GGGTGCCACC GGCGCGATGG GTGCCACCGG CGCAACGGGC GCGATGGGTG CCACGGGTGC GACGGGTGCC ACAGGCGCCA CGGGCGCGAC GGGCGCCACC GGCGCGACGG GTGCCACGGG CGCCACCGGC GCAACTGGAG CCACGGGCGC CACGGGCGCG ATGGGTGCCA CGGGTGCGAC GGGTGCCACA GGCGCCACGG GCGCGACGGG TGCCACCGGC GCGACGGGTG CCACGGGCGC CACCGGCGCG ACGGGTGCCA CGGGCGCGAC GGGTGCCACC GGCGCAACGG GCGCAACCGG AGCCACCGGC GTGACGGGTG CCACCGGCGC AACGGGTGCC ACCGGCGCGA CGGGCGCGAT GGGTGCCACG GGCGCAACGG GCGCGATGGG TGCCACCGGC GCAACGGGCG CGATGGGTGC CACGGGTGCG ACGGGTGCCA CAGGCGCCAC GGGCGCGACG GGCGCCACCG GCGCGACGGG TGCCACGGGC GCCACCGGCG CAACTGGAGC CACGGGCGCC ACGGGCGCGA TGGGCGCCAC GGGCGCGATG GGTGCCACGG GTGCGACGGG TGCCACAGGC GCCACGGGCG CGACGGGTGC CACCGGCGCG ACGGGTGCCA CGGGCGCCAC CGGCGCGACG GGTGCCACGG GCGCGACGGG TGCCACCGGC GCAACGGGCG CAACCGGAGC CACCGGCGTG ACGGGTGCCA CGGGCGCGAT GGGTGCGACG GGTGCCACGG GCGCAACGGG TGCCACCGGC GCAACGGGCG CGATGGGTGC CACGGGTGCG ACGGGTGCCA CGGGTGCGAC GGGTGCCACC GGCGCGATGG GTGCCACAGG CGCCACGGGC GCGACGGGTG CCACGGGCGC CACCGGCGCG ATGGGTGCCA CCGGCGCAAC GGGCGCGATG GGTGCCACGG GTGCGACGGG TGCCACCGGA GCCACGGGCG CAACCGGAGC CACCGGAGCC ACCGGCGCGA CGGGTGCCAC AGGCGCCACG GGCGCGACGG GTGCCACGGG CGCAACGGGT GCGATGGGTG CCACGGGCGC GACGGGTGCG ACGGGTGCCA CCGGCGCAAC GGGTGCCACC GGCGCAACGG GCGCGATGGG TGCGACCGGA GCCACGGGCG CAACGGGCGC GACGGGTGCC ACGGGCGCGA CGGGTGCCAC GGGCGCAACG GGTGCGACGG GTGCCACCGG CGCAACGGGT GCGACGGGTG CCACCGGCGC AACGGGTGCC ACCGGCGCCA CGGGCGCAAC CGGAGCCACC GGCGTGACGG GTGCCACGGG CGCGATGGGT GCCACCGGAG CCACGGGCGC AACCGGAGCC ACCGGAGCCA CCGGCGCGAC GGGTGCCACC GGAGCCACGG GCGCAACCGG AGCCACCGGA GCCACCGGCG CGACGGGTGC CACGGGCGCC ACCGGCGCGA TGGGTGCCAC GGGCGCGACG GGTGCCACCG GCGCAACGGG TGCCACGGGC GCAACCGGAG CCACCGGCGC GACGGGTGCG ACGGGTGCCA CGGGCGCGAT GGGTGCCACC GGCGCGACGG GTGCGACGGG TGCCACGGGC GCGATGGGTG CCACCGGCGC GACGGGTGCC ACGGGCGCCA CCGGCGCGAT GGGTGCCACG GGCGCGACGG GTGCCACCGG CGCAACGGGT GCCACGGGCG CAACCGGAGC CACCGGCGCG ACGGGTGCCA CGGGTGCGAC GGGTGGAAAT CCCTGCTTCG CAGAGGGGAC GCGGATTGCG ACTGTGCGGG GTGACGTGCC GGTAGAGGAG TTGGTCGCAG GCGATGTGGT CGTGCTGCAC GACGGCGGAA CGGCGCCGGT GGTGTGGCTC GGCTATCGCA CGATCGACCT CGACCGCCAC GCCAGACCCG AGGCGGTGCA GCCGATCGTG ATCGACGCCG GCGCCATTGC CGACGGCATC CCGGTCCGCG ACCTGATCGT CTCGCCGGAT CACGCTTTCT ACCTCGACGG CGTGCTCATC CCGGCAAAGG CGCTGGTTAA CGGCGCGACG ATCCGGCAGC TCCGGCGCAG CCAGGTCACC TACTTCCATG TCGAGCTGCC GCAGCATGCG GTACTTCTGG CGGAAGGCAT GGCCGCGGAG AGCTACCTCG AGACCGGCAA CCGCCCGGCC TTCGAGAATG GCGGTGACGC GATCATCCTG CATCCGGACT TCGCGCAGGC CCTGCGCGAG ACCGGAAGCT GCGCGCCCTT CGCCGAGGAA GGCGCGATCG TGGAACGGGT CCGGGCACGC ATCTTGGCCC GGGCCGGGAT CGAGACCACG GACGATGCGG ATCTCAAGAT CCGGTATCGG GCCGATGGCG CGGCGGTGAT CACCTCGCGC ACGGCGATCC CGGGCTACCT GACCCCGGAT CCGCGTGACC GCCGCGTGCT CGGTGTCAAG ATCGGCGCGA TGATGCTTGG CGACGAGCCG ATCGCGCTCG ATCACCCGGC ATTGACCGAG GGCTGGCATG ACGTCGAGGC CGACGGCCGC TGGACTGGCG GTGCGGCGGT GGTGCCGGCC AGCCTGATCA ACGGCCGCAC GCTCACGATC ACCGTGGTCG GCACGCTCGC CTACCCGGCC CACGGCGCAA ACCGCAGCGT CGAAGCCGGC TGA
|
Protein sequence | MSSSITVSPA TGNYGDVVLS GNYQDCNTPL VAVLVTSNGT YTQIGLIYDF NDSTNDDDTS NNGGGDPFTL TVSLPTGTYS VYSNSSGKDI SDVPISSGSQ GTIVFINPSS PDQVTSLVTS LSISSVGSLG PTGATGAAGA TGATGATGAT GATGATGATG AMGATGATGA TGATGATGAT GAMGATGATG AMGATGATGA TGATGATGAT GATGATGATG ATGATGATGA MGATGATGAT GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG VTGATGATGA TGATGAMGAT GATGAMGATG ATGAMGATGA TGATGATGAT GATGATGATG ATGATGATGA TGAMGATGAM GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGV TGATGAMGAT GATGATGATG ATGAMGATGA TGATGATGAT GAMGATGATG ATGATGATGA MGATGATGAM GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG AMGATGATGA TGATGATGAT GATGAMGATG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGA TGATGATGAT GVTGATGAMG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGA TGAMGATGAT GATGATGATG ATGATGATGA TGATGAMGAT GATGATGATG AMGATGATGA TGATGAMGAT GATGATGATG ATGATGATGA TGATGATGGN PCFAEGTRIA TVRGDVPVEE LVAGDVVVLH DGGTAPVVWL GYRTIDLDRH ARPEAVQPIV IDAGAIADGI PVRDLIVSPD HAFYLDGVLI PAKALVNGAT IRQLRRSQVT YFHVELPQHA VLLAEGMAAE SYLETGNRPA FENGGDAIIL HPDFAQALRE TGSCAPFAEE GAIVERVRAR ILARAGIETT DDADLKIRYR ADGAAVITSR TAIPGYLTPD PRDRRVLGVK IGAMMLGDEP IALDHPALTE GWHDVEADGR WTGGAAVVPA SLINGRTLTI TVVGTLAYPA HGANRSVEAG
|
| |