Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4266 |
Symbol | |
ID | 4073193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5068221 |
End bp | 5069810 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637986298 |
Product | hypothetical protein |
Protein accession | YP_593340 |
Protein GI | 94971292 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTATCAT GCGCTCGCGC CCGGAAGCCA TGTGCATATC GGATGGCTGA AATTGAAATG GGCTCATGTC CAGCGATCAT GGATTGTTCC GCATTCGAAC TGCCGCTCCC TCCCTTTGCT CCGGTCTTGC TGGAGTCAAT GCGCGCCATT GGGTATTCGT TCGAATCTGC AATCGCCGAT GTAATAGACA ACTCAATCTC CGCCGGTGCC CAGAACATTC AAATACGCTT TCTGCCTTAC GACGAGCCAT TCGTCTCCAT TCTCGATGAT GGAATCGGGA TGTCCGCGCG GGCGCTCGTC GAGGCGATGC GTCATGGCAG CCAAGATCCG CGTCTCCTCC GTTCCAGCTC GGATTTAGGT CGATTCGGGC TTGGGTTGAA AACCGCATCA CTGTCTCAGT GCCGCTGCCT CACCGTAATT TCGAAACAGA ACGGCGAGCT TAATGCGAGA CGTTGGGACT TGGACCTTGT GGAGAAGAGG AAGGATTGGA TTCTCACAGG GGTTCGACCA GACGAGCTTA GCACTTTGCC GCAAGTCGCA GAGCTTGCGA AATTGGAGCA CGGCACGCTC GTCCTGTGGC AACGCTTTGA TCGCCTCGCG GCTGGCGAGT CCTCAATCGA ACAGGCACTT GGCGATCGGA TTGACATGGC GAGAGAACAT CTGTCGCTGG TCTTTCATCG CTTTCTCGCA TCAGAAATCA AGGGCTCCCC GACCTTGGAA ATTGCTATCA ACGACAACCC CCTCAAAGCT CAAGATCCAT TTCTCCGCGC AAACAAAGCC ACTCAGTCGC TCCCGGAAGA GTCTCTCGCT GTGGAGGGAT GTTCCGTGAA AGTTGCGCCA TTCATCCTGC CCCATATCTC CCGACTTTCT AAGGACGATT TAAGAATGGC CGGTGGCGAA GAAGGGCTTC GAAGAAATCA GGGGTTTTAC GTTTATCGGA ACAAACGACT CATCAGTTGG GGTTCTTGGT TTAGGCTCGT CCGCCAAGAA GAAATGACAA AGCTGGCGCG AGTGCGGGTT GACATTCCAA ATGCACTTGA TCACTTGTGG ACTCTCGACG TAAAGAAATC CGCTGCTTCC CCGCCAGAAG CCATAAGGAA TGGGCTCAGA GTCATCGTGG ACAGAATTTC CGAGGGAAGC CGCCGTGTCT ACACATTTCG CGGTCGCCGC GCCAACGCTG ATGGAGTAGT GCACATTTGG GACCGCACTC TCGAACGAGG CGGAGTCACG TACACGCTGA ACCGGGAACA CCCTCTTATT ATTGCGCTGG AAAGTTTGAT TCCCGATACC GCGTTGCCGC TATTCCAGAA GCATCTTCAA TCGGTTGAGC GAACCTTCCC GTTCGACTCG CTTTATGCAG ATATGGCTTC CGAACGCCGC CCCGACCCAC CGGAGAGCCA TACCAGGACT GACGAAGAGC TGTATGACTT GGCAAGCCGT CTTCTTGATG TCGTGGGAAC TGATCCGGCG TCGACGTCAC GATTCTTGAG GAGCCTTGCA ACCATGGAAC CGTTTAGCAG ATATCCCGAG AGTATCGAGA CGCTGACCGA GAAATTAGAA TATGTCCATC GACAACGCCC GGCTGATTGA
|
Protein sequence | MLSCARARKP CAYRMAEIEM GSCPAIMDCS AFELPLPPFA PVLLESMRAI GYSFESAIAD VIDNSISAGA QNIQIRFLPY DEPFVSILDD GIGMSARALV EAMRHGSQDP RLLRSSSDLG RFGLGLKTAS LSQCRCLTVI SKQNGELNAR RWDLDLVEKR KDWILTGVRP DELSTLPQVA ELAKLEHGTL VLWQRFDRLA AGESSIEQAL GDRIDMAREH LSLVFHRFLA SEIKGSPTLE IAINDNPLKA QDPFLRANKA TQSLPEESLA VEGCSVKVAP FILPHISRLS KDDLRMAGGE EGLRRNQGFY VYRNKRLISW GSWFRLVRQE EMTKLARVRV DIPNALDHLW TLDVKKSAAS PPEAIRNGLR VIVDRISEGS RRVYTFRGRR ANADGVVHIW DRTLERGGVT YTLNREHPLI IALESLIPDT ALPLFQKHLQ SVERTFPFDS LYADMASERR PDPPESHTRT DEELYDLASR LLDVVGTDPA STSRFLRSLA TMEPFSRYPE SIETLTEKLE YVHRQRPAD
|
| |