Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1690 |
Symbol | |
ID | 4070473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2050058 |
End bp | 2051746 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983698 |
Product | integrin-like protein |
Protein accession | YP_590765 |
Protein GI | 94968717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0343612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACCGGC GCGGGTTCCT GTATGGCATG GGCTTGTTAG GCCTCGCGCA GTCTTTACCA ACTTTTGCGC AGACCGCGCC GAATCCCGGG TTCAAACTCG TGGATGTCAC AAAAGCCGCA GGGATCGACT TTCGACACAA CAACGGCGCG TATGGCGGGA AACTGCTTCC CGAGACACTC GGTCCGGGTT GCGCCTTCCT CGATTATGAT GGCGACGGTT GGCTCGATGT TCTCCTGGTC AACGGCACGG ATTGGCCCGC TCATCGTAAA CAGCGGTCGA CTCTCAAGCT CTACCGCAAC AATCGCAACG GAACTTTCGC TGACGTCACC AAGGCCGCTG GCCTCGATGT CGAGATGTAC GGCATGGGCG TGGCGGTCGC CGACTACAAC AACGATGGCT TACCCGACAT TTTTGTTACC TGCGTAGGCC AGAGCAGGCT CTTCAAAAAC ACTGGCAAGG GGACGTTCGT TGACGTCACC AAGGCCGCCG GTCTCGACAA ACGGACGGCA CTCAGCACCA GTGCTCTCTG GTTCGATTTC GATCGCGACG GGTTCCTCGA CCTGTTCGTC TGCAACTATG TGAAGTGGTC GCCTGAAACC GACATCTCTT GCAGCATTGA TGGCAAACAT AAGTCATACT GCACACCCGA AGCGTATCGC GGCGAAACCT GCTGGCTGCT GCGCAACCGT GGCGATGGAA CCTTTGAGGA CGTCACCGCG AAGAGCGGGA TATTCGACTC AAGCTCGAAA TCTCTAGGCG TCGCGATGCT CGACAGTAAT GGCGATGGCT GGCCGGACCT GGTTGTTGCT AACGATACCC AACCCAACAA GCTCTATCGC AACTTGAAGA ATGGGAAATT TCAGGACGTT GGAGTAGAGA CGGGCCTCGC GTTCAGTATG GATGGCAAAG CGCGCGCGGG CATGGGCATT GACGTTGGCG ACTTCGAGAA CTCCGGCCGA CCCGGCATTG CGGTTACGAA CTTCGACAAC GAGATGATCG GACTATACAA GAGCGATGCC GCCGGCAACT TCAACGATGT CGCAGTGACG GCCGGCGTCG GAGGGCCTTC GCGACGCATG CTCGGCTTCG GCTGCGCATT CGCAGACTTA AATCTCGATG GAGCGCTGGA TCTGATCGCC GTCAACGGCC ACATTGATGA CACTGTTCGT GCCGTTCGGC GCGACGTGAA CTTCGAGCAA GCGCCGCAGA TTTTCCTCAA CGATGGCCAC GGAAAATTCC GCGACATCGC TGGAAGCATC GGCGCAGACT TCGCCACGCC TAAAGTCGCA CGAGGCCTCG CGTACGGAGA CTTTGACCGC GACGGCGACC TCGACCTCCT GCTTACGACC AATGCCGGCC CGGCCTACTT GTACCGCAAC GACCTCAGCA GCGGTAACCG GAGTATTCGC TTCCATCTCA CAGGAACGAA GTCAAATCGC GATGCAATCG GCGCAACAGT GAAGGTAACG TCCGCTGGCG TTACGCAATC GCGCATTGTG CATAGCGGTT CGAGCTACCT CTCGCAATCG GAGCTGGCGT TGACGTTCGG GGTGGGCCAG CGCGACAAAA TCGATCGTGT CGAGATCTCG TGGCCGAGCG GACGCACGGA AGAGTTCAAG AATCTCGCGA GCGGCAAAGG CTACGCTGCC GTAGAAGCCA AAGGCTGCAA TCCACTCGAT CATTTCTAA
|
Protein sequence | MNRRGFLYGM GLLGLAQSLP TFAQTAPNPG FKLVDVTKAA GIDFRHNNGA YGGKLLPETL GPGCAFLDYD GDGWLDVLLV NGTDWPAHRK QRSTLKLYRN NRNGTFADVT KAAGLDVEMY GMGVAVADYN NDGLPDIFVT CVGQSRLFKN TGKGTFVDVT KAAGLDKRTA LSTSALWFDF DRDGFLDLFV CNYVKWSPET DISCSIDGKH KSYCTPEAYR GETCWLLRNR GDGTFEDVTA KSGIFDSSSK SLGVAMLDSN GDGWPDLVVA NDTQPNKLYR NLKNGKFQDV GVETGLAFSM DGKARAGMGI DVGDFENSGR PGIAVTNFDN EMIGLYKSDA AGNFNDVAVT AGVGGPSRRM LGFGCAFADL NLDGALDLIA VNGHIDDTVR AVRRDVNFEQ APQIFLNDGH GKFRDIAGSI GADFATPKVA RGLAYGDFDR DGDLDLLLTT NAGPAYLYRN DLSSGNRSIR FHLTGTKSNR DAIGATVKVT SAGVTQSRIV HSGSSYLSQS ELALTFGVGQ RDKIDRVEIS WPSGRTEEFK NLASGKGYAA VEAKGCNPLD HF
|
| |