Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2795 |
Symbol | |
ID | 8829207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013930 |
Strand | + |
Start bp | 149017 |
End bp | 150744 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | Procollagen-proline dioxygenase |
Protein accession | YP_003494747 |
Protein GI | 290243077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAATG GCGAGGTAAG GATCATTGAG GGCGAGATTC CCGATTCAGT TCGGGAGGCC GTAACGAGCT CGCTCCCGGT GGTGGCCGAG GAAGCCGAGC CGGAGAGGGT CGAACGAAAC CGGATGCCGG CGGAACGATA CGACGGGATG GAGACGCTAT CGCAGGACCC GCTGGTGGTC TATCTCGATG AGTTTCTGGA GCCGGGGGAG TGTGAAGCGC TCATTCATCT GGCGCAGGGC CGCATGAAGC GTGCGCTGGT GTCGCTCGAT GGGAGTAGCG GCGTGAGTCA GGGCCGGACG GGCTCCAACT GCTGGCTGCG CTATCAGGAA GAGCCGCTGG CGCGCCGCAT CGGAGAGCGG GTCGCAAAAC GGGTCGGATT CCCCTTGGAA TACGCCGAGC CGCTTCAGGT TATTCACTAC GGTCACGAGC AGGAATACCG ACCTCATTAC GATGCATACG ATCTGGATAC GCCGCGGGGA CTGAGGTGTA CACGGCAGGG CGGACAGCGA ATGGTTACCG CATTGCTGTA CCTCAATGAG GTCGAGGAAG GGGGTGCTAC GGCATTTCCA AACGCCGGAG TCGAAGTCGC GCCCCGCAAG GGGAGGATCG CTATATTCAA TAATGTCGGT GCTGACCCGG GTCGCCCACA TCCACGCAGT TTGCATGGCG GCATGCCGGT GAAGAGCGGC GAAAAGTGGG CGGCCAGCAT CTGGTTCCGC GCCCGCCCGG CGCACGAACG GCAACCCTGG TTCGATGACG TGGAGGACGC CAGCGCCCAG GTCCCCGAGG GCGAGGGTGG TCACTGGCCG GTAGTGGCGA GCAATCGGGC GCAAAGCATC CTTCAGCCTG CGCTGGAGAG GGCGGCGCCG ATGCTGCCCC CGGAAGCTGG CGACGTAATG GTCGAATACT GTGTCGGCCC GGACAATCAG CGCGAGGAAA GTTCTGAAGC GGAGGCCTTC GGGCTTGTGG TGCGGGCGAT GCCTTCGAGC ATCACCAATG AAGCCGAGAA CAAGAAGAAC GTCGTGCGCA AAATGAAGGA GGCCGGGCAC GCGGAGCGCA TCCCGCTCTC TTGTGATTCG ATCAGCGACG CGATGGGGCT TCCGGGGGCT CGCGACGCCG TGTGGTTTGT CCGTCCATCG TTTGGGAGTG CGGGACGGGG TACGCATTGC GTGCGGGGGG CGAGCCTGCG GGGGGCGAGC CTGCATCCCC AGCAGTTCTT GCAGCTGGCG GAGGAAAGTC TACTTCTCAT CAGAGGGCGA AAGTTTCTCA CGCGTGCATT CGTTCTGGTC TGGGGCGGGG CGGCCTACCT GTTTGATGAG GGGTATGTGC TGATGCATGG GGCTCAGTAC CAGGTCGGGA GCACCAATGC GGCCACGCAG ATGGATCACC GCAATGCGCA TGACCCGTCA GGTCCGCTGG TCCAGGAAGT GTTCCATGAG GTAGCGCAGC TGAAGGACTC TCACTGGGAG GATCTGTCGG CTGCGGTTAC GGCCGTAGTG GAGGCATTCC CGGGGCTGGC CGAAAATTCA TCGGCGACCA CGTTTGCGGT GCTCGGGGTG GATGCGCTGT TCCGCGAGAA TGGGCACGCG CTGATCCTCG ATATCAGCAC AATGCCGAAT TTCGTGCAGC AGCCAGCAAT CAACGACCGG GTCACGATCC CGTTGTGGGT ATCGATTTTC GAGATGCTGG CGGGCACGGG AAGCCAGCGA TTTAAACGCA TCACTTGA
|
Protein sequence | MVNGEVRIIE GEIPDSVREA VTSSLPVVAE EAEPERVERN RMPAERYDGM ETLSQDPLVV YLDEFLEPGE CEALIHLAQG RMKRALVSLD GSSGVSQGRT GSNCWLRYQE EPLARRIGER VAKRVGFPLE YAEPLQVIHY GHEQEYRPHY DAYDLDTPRG LRCTRQGGQR MVTALLYLNE VEEGGATAFP NAGVEVAPRK GRIAIFNNVG ADPGRPHPRS LHGGMPVKSG EKWAASIWFR ARPAHERQPW FDDVEDASAQ VPEGEGGHWP VVASNRAQSI LQPALERAAP MLPPEAGDVM VEYCVGPDNQ REESSEAEAF GLVVRAMPSS ITNEAENKKN VVRKMKEAGH AERIPLSCDS ISDAMGLPGA RDAVWFVRPS FGSAGRGTHC VRGASLRGAS LHPQQFLQLA EESLLLIRGR KFLTRAFVLV WGGAAYLFDE GYVLMHGAQY QVGSTNAATQ MDHRNAHDPS GPLVQEVFHE VAQLKDSHWE DLSAAVTAVV EAFPGLAENS SATTFAVLGV DALFRENGHA LILDISTMPN FVQQPAINDR VTIPLWVSIF EMLAGTGSQR FKRIT
|
| |