Gene TK90_2795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_2795 
Symbol 
ID8829207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013930 
Strand
Start bp149017 
End bp150744 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content62% 
IMG OID 
ProductProcollagen-proline dioxygenase 
Protein accessionYP_003494747 
Protein GI290243077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATG GCGAGGTAAG GATCATTGAG GGCGAGATTC CCGATTCAGT TCGGGAGGCC 
GTAACGAGCT CGCTCCCGGT GGTGGCCGAG GAAGCCGAGC CGGAGAGGGT CGAACGAAAC
CGGATGCCGG CGGAACGATA CGACGGGATG GAGACGCTAT CGCAGGACCC GCTGGTGGTC
TATCTCGATG AGTTTCTGGA GCCGGGGGAG TGTGAAGCGC TCATTCATCT GGCGCAGGGC
CGCATGAAGC GTGCGCTGGT GTCGCTCGAT GGGAGTAGCG GCGTGAGTCA GGGCCGGACG
GGCTCCAACT GCTGGCTGCG CTATCAGGAA GAGCCGCTGG CGCGCCGCAT CGGAGAGCGG
GTCGCAAAAC GGGTCGGATT CCCCTTGGAA TACGCCGAGC CGCTTCAGGT TATTCACTAC
GGTCACGAGC AGGAATACCG ACCTCATTAC GATGCATACG ATCTGGATAC GCCGCGGGGA
CTGAGGTGTA CACGGCAGGG CGGACAGCGA ATGGTTACCG CATTGCTGTA CCTCAATGAG
GTCGAGGAAG GGGGTGCTAC GGCATTTCCA AACGCCGGAG TCGAAGTCGC GCCCCGCAAG
GGGAGGATCG CTATATTCAA TAATGTCGGT GCTGACCCGG GTCGCCCACA TCCACGCAGT
TTGCATGGCG GCATGCCGGT GAAGAGCGGC GAAAAGTGGG CGGCCAGCAT CTGGTTCCGC
GCCCGCCCGG CGCACGAACG GCAACCCTGG TTCGATGACG TGGAGGACGC CAGCGCCCAG
GTCCCCGAGG GCGAGGGTGG TCACTGGCCG GTAGTGGCGA GCAATCGGGC GCAAAGCATC
CTTCAGCCTG CGCTGGAGAG GGCGGCGCCG ATGCTGCCCC CGGAAGCTGG CGACGTAATG
GTCGAATACT GTGTCGGCCC GGACAATCAG CGCGAGGAAA GTTCTGAAGC GGAGGCCTTC
GGGCTTGTGG TGCGGGCGAT GCCTTCGAGC ATCACCAATG AAGCCGAGAA CAAGAAGAAC
GTCGTGCGCA AAATGAAGGA GGCCGGGCAC GCGGAGCGCA TCCCGCTCTC TTGTGATTCG
ATCAGCGACG CGATGGGGCT TCCGGGGGCT CGCGACGCCG TGTGGTTTGT CCGTCCATCG
TTTGGGAGTG CGGGACGGGG TACGCATTGC GTGCGGGGGG CGAGCCTGCG GGGGGCGAGC
CTGCATCCCC AGCAGTTCTT GCAGCTGGCG GAGGAAAGTC TACTTCTCAT CAGAGGGCGA
AAGTTTCTCA CGCGTGCATT CGTTCTGGTC TGGGGCGGGG CGGCCTACCT GTTTGATGAG
GGGTATGTGC TGATGCATGG GGCTCAGTAC CAGGTCGGGA GCACCAATGC GGCCACGCAG
ATGGATCACC GCAATGCGCA TGACCCGTCA GGTCCGCTGG TCCAGGAAGT GTTCCATGAG
GTAGCGCAGC TGAAGGACTC TCACTGGGAG GATCTGTCGG CTGCGGTTAC GGCCGTAGTG
GAGGCATTCC CGGGGCTGGC CGAAAATTCA TCGGCGACCA CGTTTGCGGT GCTCGGGGTG
GATGCGCTGT TCCGCGAGAA TGGGCACGCG CTGATCCTCG ATATCAGCAC AATGCCGAAT
TTCGTGCAGC AGCCAGCAAT CAACGACCGG GTCACGATCC CGTTGTGGGT ATCGATTTTC
GAGATGCTGG CGGGCACGGG AAGCCAGCGA TTTAAACGCA TCACTTGA
 
Protein sequence
MVNGEVRIIE GEIPDSVREA VTSSLPVVAE EAEPERVERN RMPAERYDGM ETLSQDPLVV 
YLDEFLEPGE CEALIHLAQG RMKRALVSLD GSSGVSQGRT GSNCWLRYQE EPLARRIGER
VAKRVGFPLE YAEPLQVIHY GHEQEYRPHY DAYDLDTPRG LRCTRQGGQR MVTALLYLNE
VEEGGATAFP NAGVEVAPRK GRIAIFNNVG ADPGRPHPRS LHGGMPVKSG EKWAASIWFR
ARPAHERQPW FDDVEDASAQ VPEGEGGHWP VVASNRAQSI LQPALERAAP MLPPEAGDVM
VEYCVGPDNQ REESSEAEAF GLVVRAMPSS ITNEAENKKN VVRKMKEAGH AERIPLSCDS
ISDAMGLPGA RDAVWFVRPS FGSAGRGTHC VRGASLRGAS LHPQQFLQLA EESLLLIRGR
KFLTRAFVLV WGGAAYLFDE GYVLMHGAQY QVGSTNAATQ MDHRNAHDPS GPLVQEVFHE
VAQLKDSHWE DLSAAVTAVV EAFPGLAENS SATTFAVLGV DALFRENGHA LILDISTMPN
FVQQPAINDR VTIPLWVSIF EMLAGTGSQR FKRIT