Gene Acry_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1166 
Symbol 
ID5161714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1299633 
End bp1302785 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content72% 
IMG OID640553080 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001234297 
Protein GI148260170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000520628 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGTT CGATTACTGT AAGTCCCGCG ACGGGAAACT ACGGCGATGT TGTTTTGAGC 
GGAAATTATC AAGATTGTAA CACGCCTCTT GTGGCCGTAC TGGTTACGTC GAACGGCACT
TATACTCAAA TCGGTCTCAT TTATGACTTC AATGATAGCA CAAATGATGA TGACACGTCG
AATAATGGTG GTGGAGATCC ATTTACTTTA ACTGTTTCTC TGCCTACCGG AACATATTCC
GTATATTCGA ATAGTTCTGG AAAAGATATC AGTGACGTTC CGATAAGCTC GGGCAGTCAA
GGCACGATCG TTTTTATTAA TCCGTCTTCT CCTGACCAAG TCACAAGCTT GGTGACGTCT
CTGTCTATTT CATCCGTAGG ATCGCTGGGT CCGACGGGCG CTACGGGTGC CGCCGGCGCA
ACTGGAGCCA CGGGTGCCAC CGGCGCGACG GGTGCCACGG GCGCGACGGG TGCCACCGGC
GCGATGGGTG CCACCGGCGC AACGGGTGCG ACAGGCGCCA CGGGCGCAAC GGGTGCCACC
GGCGCGATGG GTGCCACCGG CGCAACGGGC GCGATGGGTG CCACGGGTGC GACGGGTGCC
ACAGGCGCCA CGGGCGCGAC GGGCGCCACC GGCGCGACGG GTGCCACGGG CGCCACCGGC
GCAACTGGAG CCACGGGCGC CACGGGCGCG ATGGGTGCCA CGGGTGCGAC GGGTGCCACA
GGCGCCACGG GCGCGACGGG TGCCACCGGC GCGACGGGTG CCACGGGCGC CACCGGCGCG
ACGGGTGCCA CGGGCGCGAC GGGTGCCACC GGCGCAACGG GCGCAACCGG AGCCACCGGC
GTGACGGGTG CCACCGGCGC AACGGGTGCC ACCGGCGCGA CGGGCGCGAT GGGTGCCACG
GGCGCAACGG GCGCGATGGG TGCCACCGGC GCAACGGGCG CGATGGGTGC CACGGGTGCG
ACGGGTGCCA CAGGCGCCAC GGGCGCGACG GGCGCCACCG GCGCGACGGG TGCCACGGGC
GCCACCGGCG CAACTGGAGC CACGGGCGCC ACGGGCGCGA TGGGCGCCAC GGGCGCGATG
GGTGCCACGG GTGCGACGGG TGCCACAGGC GCCACGGGCG CGACGGGTGC CACCGGCGCG
ACGGGTGCCA CGGGCGCCAC CGGCGCGACG GGTGCCACGG GCGCGACGGG TGCCACCGGC
GCAACGGGCG CAACCGGAGC CACCGGCGTG ACGGGTGCCA CGGGCGCGAT GGGTGCGACG
GGTGCCACGG GCGCAACGGG TGCCACCGGC GCAACGGGCG CGATGGGTGC CACGGGTGCG
ACGGGTGCCA CGGGTGCGAC GGGTGCCACC GGCGCGATGG GTGCCACAGG CGCCACGGGC
GCGACGGGTG CCACGGGCGC CACCGGCGCG ATGGGTGCCA CCGGCGCAAC GGGCGCGATG
GGTGCCACGG GTGCGACGGG TGCCACCGGA GCCACGGGCG CAACCGGAGC CACCGGAGCC
ACCGGCGCGA CGGGTGCCAC AGGCGCCACG GGCGCGACGG GTGCCACGGG CGCAACGGGT
GCGATGGGTG CCACGGGCGC GACGGGTGCG ACGGGTGCCA CCGGCGCAAC GGGTGCCACC
GGCGCAACGG GCGCGATGGG TGCGACCGGA GCCACGGGCG CAACGGGCGC GACGGGTGCC
ACGGGCGCGA CGGGTGCCAC GGGCGCAACG GGTGCGACGG GTGCCACCGG CGCAACGGGT
GCGACGGGTG CCACCGGCGC AACGGGTGCC ACCGGCGCCA CGGGCGCAAC CGGAGCCACC
GGCGTGACGG GTGCCACGGG CGCGATGGGT GCCACCGGAG CCACGGGCGC AACCGGAGCC
ACCGGAGCCA CCGGCGCGAC GGGTGCCACC GGAGCCACGG GCGCAACCGG AGCCACCGGA
GCCACCGGCG CGACGGGTGC CACGGGCGCC ACCGGCGCGA TGGGTGCCAC GGGCGCGACG
GGTGCCACCG GCGCAACGGG TGCCACGGGC GCAACCGGAG CCACCGGCGC GACGGGTGCG
ACGGGTGCCA CGGGCGCGAT GGGTGCCACC GGCGCGACGG GTGCGACGGG TGCCACGGGC
GCGATGGGTG CCACCGGCGC GACGGGTGCC ACGGGCGCCA CCGGCGCGAT GGGTGCCACG
GGCGCGACGG GTGCCACCGG CGCAACGGGT GCCACGGGCG CAACCGGAGC CACCGGCGCG
ACGGGTGCCA CGGGTGCGAC GGGTGGAAAT CCCTGCTTCG CAGAGGGGAC GCGGATTGCG
ACTGTGCGGG GTGACGTGCC GGTAGAGGAG TTGGTCGCAG GCGATGTGGT CGTGCTGCAC
GACGGCGGAA CGGCGCCGGT GGTGTGGCTC GGCTATCGCA CGATCGACCT CGACCGCCAC
GCCAGACCCG AGGCGGTGCA GCCGATCGTG ATCGACGCCG GCGCCATTGC CGACGGCATC
CCGGTCCGCG ACCTGATCGT CTCGCCGGAT CACGCTTTCT ACCTCGACGG CGTGCTCATC
CCGGCAAAGG CGCTGGTTAA CGGCGCGACG ATCCGGCAGC TCCGGCGCAG CCAGGTCACC
TACTTCCATG TCGAGCTGCC GCAGCATGCG GTACTTCTGG CGGAAGGCAT GGCCGCGGAG
AGCTACCTCG AGACCGGCAA CCGCCCGGCC TTCGAGAATG GCGGTGACGC GATCATCCTG
CATCCGGACT TCGCGCAGGC CCTGCGCGAG ACCGGAAGCT GCGCGCCCTT CGCCGAGGAA
GGCGCGATCG TGGAACGGGT CCGGGCACGC ATCTTGGCCC GGGCCGGGAT CGAGACCACG
GACGATGCGG ATCTCAAGAT CCGGTATCGG GCCGATGGCG CGGCGGTGAT CACCTCGCGC
ACGGCGATCC CGGGCTACCT GACCCCGGAT CCGCGTGACC GCCGCGTGCT CGGTGTCAAG
ATCGGCGCGA TGATGCTTGG CGACGAGCCG ATCGCGCTCG ATCACCCGGC ATTGACCGAG
GGCTGGCATG ACGTCGAGGC CGACGGCCGC TGGACTGGCG GTGCGGCGGT GGTGCCGGCC
AGCCTGATCA ACGGCCGCAC GCTCACGATC ACCGTGGTCG GCACGCTCGC CTACCCGGCC
CACGGCGCAA ACCGCAGCGT CGAAGCCGGC TGA
 
Protein sequence
MSSSITVSPA TGNYGDVVLS GNYQDCNTPL VAVLVTSNGT YTQIGLIYDF NDSTNDDDTS 
NNGGGDPFTL TVSLPTGTYS VYSNSSGKDI SDVPISSGSQ GTIVFINPSS PDQVTSLVTS
LSISSVGSLG PTGATGAAGA TGATGATGAT GATGATGATG AMGATGATGA TGATGATGAT
GAMGATGATG AMGATGATGA TGATGATGAT GATGATGATG ATGATGATGA MGATGATGAT
GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG VTGATGATGA TGATGAMGAT
GATGAMGATG ATGAMGATGA TGATGATGAT GATGATGATG ATGATGATGA TGAMGATGAM
GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGV TGATGAMGAT
GATGATGATG ATGAMGATGA TGATGATGAT GAMGATGATG ATGATGATGA MGATGATGAM
GATGATGATG ATGATGATGA TGATGATGAT GATGATGATG AMGATGATGA TGATGATGAT
GATGAMGATG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGA TGATGATGAT
GVTGATGAMG ATGATGATGA TGATGATGAT GATGATGATG ATGATGATGA TGAMGATGAT
GATGATGATG ATGATGATGA TGATGAMGAT GATGATGATG AMGATGATGA TGATGAMGAT
GATGATGATG ATGATGATGA TGATGATGGN PCFAEGTRIA TVRGDVPVEE LVAGDVVVLH
DGGTAPVVWL GYRTIDLDRH ARPEAVQPIV IDAGAIADGI PVRDLIVSPD HAFYLDGVLI
PAKALVNGAT IRQLRRSQVT YFHVELPQHA VLLAEGMAAE SYLETGNRPA FENGGDAIIL
HPDFAQALRE TGSCAPFAEE GAIVERVRAR ILARAGIETT DDADLKIRYR ADGAAVITSR
TAIPGYLTPD PRDRRVLGVK IGAMMLGDEP IALDHPALTE GWHDVEADGR WTGGAAVVPA
SLINGRTLTI TVVGTLAYPA HGANRSVEAG