Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1202 |
Symbol | |
ID | 8806964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1278371 |
End bp | 1280701 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003460447 |
Protein GI | 289208381 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.177641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGTCG CGGTGCTGGG TCTGATCGCC GGTGTGTGGG GGTTGCACCA GCTCCCCGTC CTGCCTGTCT TGCTCGAAGG TCCATTGAGA GTGTTCCTGC CTGTCGCTGG CGCGCTGGTC GCACTTGGCT GGGTCCAGGC GGCTCGGCGA GGTTCTCCTT TCACGTGGTT GCCAGCAGCC TCGCTGGGGT TGCTCGCGGG CGTGTGGCTG GCACTCCCGC ATGCGACGGA CTGGTCGGGC AAGGTCGTGG GTCCCGAGTG GTCGGGAGTC GAGGTCTCGG TGCGGGCACA GGTGGTCAGT TTGCCGGAGC ACGGGGAGCG GCGCAGCCGG TTCCGGGTTG GGGTGTCGGA GATCGAGCAG GGCAGCCCGG GGCTGGAGCT TGAAGGCCGC GAGTTGTGGG TGAGCACCTT CCCCGCGCGG CCGGCGATAC AGGTGGGTGA TTCGCTGCGG CTGGACTTGC GATTGCGTGG TGTCGACGGC CCCCGCAACC CCGGGGGATT CGATGCCGCA GGCTGGTTCT ACCGCGAGGG GATGCACGGC AGCGCCGTTG CGCGTTCCGT TACCCCACTG GCAGGCGTGC ACGAGGGGGC CAGGGGCCGC GTGGTCATGC ACCGCCTTCG GGCCGCAATG CAGGCACGTC TGGAGCGCGC GGCACCGGAT TTGCGTCACC CGGGCCTGGT GCAGGCGCTG GTCATCGGCA ATCGTCAGGC CATGACCGAG AGTGAATGGC AGGCCTTTCT ACACACCGGG ACGAATCACC TGATGGCGAT TTCCGGGTTG CATGTGGCCC TGGTGGCGGG GTTTGCCGGC GGCATTGCCG GATGGTTGTG GTCCGGGGTG GCCGTTATAA GACGGCTGCG GCGCTGGCTG TTCATGGGTG TGGTCGGGCT GGTGGCCGCC ACGGGCTATG CGGTACTCGC CGGTTTCAGC ATCCCGACCC AAAGGGCTTT GCTGATGCTG ATTGCGCTCA CGCTTGCGGT GCTGTCGCGT CGCGAGGGTG TTGCCTGGCG CGCCCTGGCC CTTGCCGCGG CCATGGTGCT GATTGTGCAC CCACCGAGCG TGCTGGCCCC GGGGTTCTGG TTCTCGTTCG GCGCGGTGGC CGTGATCCTG CTGCTGTTGC AGGGTCGGGT CGGGACGACC GGCTGGCGCG AAGGCCTGCG CATCCAGGTC GTGCTGGCGA TTGCGATGCT GCCGCTCAGT CTGGCCTGGT TCCAGCTGGG CTCCTGGGTC GCACCCGTGG CCAACCTGGT CGCGGTGCCG ATGGTGACGC TATTGATCTT GCCGTTGTTG CTGGTCGGTG CCTTGCTCGC GTGGGGGTGG CCGGCCGCAG GTGGCTTGTT GTTAGGCATT GCGGATGGCC TGTTGGCACT GCTGGTGATG ACGCTCGAGG CACTGGCAGG GATCCCTGTC CTGGTGGACC AGCGCCATGT GCCGGTGGCC GCGTCCCTGC TGGGCGGGGT GGCTGTTTTC CTGGCCTTGC AGCCGCTCGC CCGGCGATTG GCGCCCTGGA TTGCAGTGGC TGCGATCGCC CTGGTCGCGC CGTTGCGCCC GGACTTTGCG GCCGGGCAGT GGCGCGTTCA GGTCCTGGAC GTTGGCGCCG GGCAGGCCAC CATCGTCGAG ACCCGACGAC ACGCGCTGTT AATTGATGCG GGTCCGGGCC GGGAGGGCGG CTTCAGCGCG GGTGACCGGA TTGTGGTGCC GGCCTTGCGG GCGGGCGGCA TCCGCTCCCT GGGCACGCTG ATGGTGACGC ACGAGCATGC CGGGCACGCC GGCGGCGTGG CGGCAGTCCG GGAACAGATC TCGGTGAGGC GCACGCTGCG GCGCACCCCG GTGAGCCATG GCACTGACGA ACGTTGCGAA AAGGGGGATC AGTGGAACTG GGACGGTGTG CGCTTCGAGG TACTCCATCC GCCGCCGGGC TGGAATGACG ACTCGTCCGC GTCCTGTGTG CTGAAGGTGG AGGGGGCGGA CGCGACCCTG CTGGTCATGG GCGGGCTGGA TGGTCTCGGC GAGGCCGTGA TGCGTCGGGG CACCGCAGGA TTTGCGGTTG ACTTGCTGGT AGCGCCGAGG ACTTCCAACC CGCGCGCCCT GCAGGACGAC TGGCTGGCGC AATTCCCGCC CGGTCAGGTT TGGGCTGCGA CGACTGCCGA GGGTGCCGGT TTGCCCGACG AGGCACGGAA ACGGCTGGAG CTAAGCGGTA TTCCCCTGTA CGAGACTGGG CGCAGCGGGG CGCTGGTCAC GAGCAGTACC CAACTCGACA GCGCTCCGGC GACAGGCCGT CCGCGGCTGC GATTCTGGCA TCCAATAAGC CCTGACCGAA ATGAGCCCTG A
|
Protein sequence | MMVAVLGLIA GVWGLHQLPV LPVLLEGPLR VFLPVAGALV ALGWVQAARR GSPFTWLPAA SLGLLAGVWL ALPHATDWSG KVVGPEWSGV EVSVRAQVVS LPEHGERRSR FRVGVSEIEQ GSPGLELEGR ELWVSTFPAR PAIQVGDSLR LDLRLRGVDG PRNPGGFDAA GWFYREGMHG SAVARSVTPL AGVHEGARGR VVMHRLRAAM QARLERAAPD LRHPGLVQAL VIGNRQAMTE SEWQAFLHTG TNHLMAISGL HVALVAGFAG GIAGWLWSGV AVIRRLRRWL FMGVVGLVAA TGYAVLAGFS IPTQRALLML IALTLAVLSR REGVAWRALA LAAAMVLIVH PPSVLAPGFW FSFGAVAVIL LLLQGRVGTT GWREGLRIQV VLAIAMLPLS LAWFQLGSWV APVANLVAVP MVTLLILPLL LVGALLAWGW PAAGGLLLGI ADGLLALLVM TLEALAGIPV LVDQRHVPVA ASLLGGVAVF LALQPLARRL APWIAVAAIA LVAPLRPDFA AGQWRVQVLD VGAGQATIVE TRRHALLIDA GPGREGGFSA GDRIVVPALR AGGIRSLGTL MVTHEHAGHA GGVAAVREQI SVRRTLRRTP VSHGTDERCE KGDQWNWDGV RFEVLHPPPG WNDDSSASCV LKVEGADATL LVMGGLDGLG EAVMRRGTAG FAVDLLVAPR TSNPRALQDD WLAQFPPGQV WAATTAEGAG LPDEARKRLE LSGIPLYETG RSGALVTSST QLDSAPATGR PRLRFWHPIS PDRNEP
|
| |