Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4710 |
Symbol | |
ID | 4070649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5570208 |
End bp | 5572022 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986755 |
Product | integrin-like protein |
Protein accession | YP_593784 |
Protein GI | 94971736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATCGAT TCCGTCGCCT ACTCTTCATT TTGCTCACGT TCACTGGCGT TGCTCTCGCC GCCGTCGGCA GCAGCACCAC CACAACTCTG AGCATAACCA GCGCCGGGGC ACCGGTGGCT TACGTCTCCG CACCTTCCAC GATCAAGCTC ACGGCGACAG TGAAATCCTC CGCCTTGGCC GTTTTTCCCG GGCTGGTAAA GTTCTGCGAT AAATCCATCA ACGCTACCTG CAGCGGAAAT GCGTTCCTGG GAACCGCGCA GCTCACGTCC TCCGGCAAAG CATCGCTCGC AACCCGTTTC GGATATGGCG CGCATTCTCT GCAGGCAATC TTCGTCGGCA CGCGCTTCTA CTTCAGTTCG CAGTCCAGCG CGGCTTCGTT CTCGCAGAAG TGGAAAACAG GAGCCTCGCT CTCGCACGTC AATTCCGTCG CTGGCGCAAA GCCTGGAACG TTTGACCTCT CTGTCACTTT GACGGGCTTA CAAGCCGCAG CGCTGCCGGC TGCACCTACA GGCTCGATCT CGTTTCTTGA TGCCAGCAAC GGCAACGCGG TCGTAGCCAC TGCACCGTTA GGCGGCGGTG CCGGGGGAAC ACTCACAGTT CGCCAGACCG ACCAGCCTCT TGTCGGCGCG ATGCCATGGT CCGCGATGAC CGCCGACTTC AACAACGACG GCATCCCCGA CGTTGCCGTG CACAACGGCG AAGGCACGCT CTCCATCCTC CTCGGAAATG GTGACGGAAC CTTCCAGCCG CAAGCGAAAA TTTCCGGGAC GTTCATCCCC GAACCCCAAG CTCTCGCTAT CGCTGATTTC AATTCGGATG GAAATGCGGA CCTTGCATAT GCCACCACGG TTGGCGTCTC CGGTGGATAC TCAATCCTCT TCGGCAATGG CAATGGAACC TTCCAGGCAC CTGTTACCAC CCCCCTTCCG GGCGGCGGGA CGTGGGGCTT GGCCGCGGCT GACTTCAATC GCGACGGCAT TCCCGACCTC TTCGCGCTCC CACTCGGCCC CAGCGGTACA CCCATTCCCA GCGGCTTCAT GATCCTGATC GGAAATGGCG ACGGATCCTT CGTTTCGTCA ACCCAGCACC CGGAGATGGG CCTTGTAATG TCATTCGCGA CCGCCGATTT CAACGGCGAT GGCATTACCG ATCTCGTCGT CGGCTACGGA ACGTTCGGCA ACACGCAACT CGGCGTCCTG CTCGGCAATG GCGATGGAAC CTTCCAGGCA CCCGCATCAC CCGTTGCTTT GGGGAACTCG TATAGGTTTG CAACTGCCGA TCTGAACCAC GACGGGAAAC TAGACTTGGT CGTCGCGGGG GCACAGTCTG CATCGGGCTA CTACGGCGCT TATTCCTTAC TCGGAAATGG CGACGGCTCG TTCCAAACCC CGGTTGCCAT TGACACAACG TCCGTCTTTC CGGCTTCTCT CGCTGACTTC AACTTCGATG GCATCCCCGA CGTCGTGATC GGACACACCG ACTATCACGG CTCGATCGAA GTACGGCTCG GCAATGGAGA CGGTTCATTC CGTGCAGGCG TCACGCTGCC CACCGGAGCC TACCCGGTTC AACTCCCGAT CGTTGATCTC AACGGCGACG GCAAGCCCGA CATCCTCGTG ATTAATCAGG GATCGAACAA CTCTCATCCC GAGCGGCTGA ATGTCCTGCT GGACTGGTGG GGCAATCCGT CGAGTGTCAC CGCGAGCAGC GTAACCATCA CGGGCGCCGG CACCCACAAC ATCAAAGCGT CGTTCGCTGG CGACACCAAT TACCGTCAGT CGACTTCGGC GACAACCGCT GTCACCGTAC CCTAG
|
Protein sequence | MDRFRRLLFI LLTFTGVALA AVGSSTTTTL SITSAGAPVA YVSAPSTIKL TATVKSSALA VFPGLVKFCD KSINATCSGN AFLGTAQLTS SGKASLATRF GYGAHSLQAI FVGTRFYFSS QSSAASFSQK WKTGASLSHV NSVAGAKPGT FDLSVTLTGL QAAALPAAPT GSISFLDASN GNAVVATAPL GGGAGGTLTV RQTDQPLVGA MPWSAMTADF NNDGIPDVAV HNGEGTLSIL LGNGDGTFQP QAKISGTFIP EPQALAIADF NSDGNADLAY ATTVGVSGGY SILFGNGNGT FQAPVTTPLP GGGTWGLAAA DFNRDGIPDL FALPLGPSGT PIPSGFMILI GNGDGSFVSS TQHPEMGLVM SFATADFNGD GITDLVVGYG TFGNTQLGVL LGNGDGTFQA PASPVALGNS YRFATADLNH DGKLDLVVAG AQSASGYYGA YSLLGNGDGS FQTPVAIDTT SVFPASLADF NFDGIPDVVI GHTDYHGSIE VRLGNGDGSF RAGVTLPTGA YPVQLPIVDL NGDGKPDILV INQGSNNSHP ERLNVLLDWW GNPSSVTASS VTITGAGTHN IKASFAGDTN YRQSTSATTA VTVP
|
| |