Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0655 |
Symbol | |
ID | 4069747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 807027 |
End bp | 809276 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982661 |
Product | hypothetical protein |
Protein accession | YP_589734 |
Protein GI | 94967686 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.792634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCTT CTGATCATGA CGTCAGCCGT CGCACATTTC TAAAGTTCGG GGCCGCTGGA ACGACCTTGG CCGCTCTTAC TCCCACAGCG GTCGCGAATC CCGGGTTCTT CGGCACCGAT CCAATCGTCA AAAAAAGGCT GACGTGCCCG CCATCGCTTG CCGACATGGC CTCCGATCCG CAGCGCTATC AATTCCGAGA TCTTTTCAAT TCACCGGCGG CGATGAACGA GTTTGGGTAC GCGCAGGTTG GCAAGTCGGT CTCGGCAATC ACCGCGATCT CATTTCCTCC GTACGCATGC TGCGCTCCGC CGTCGATGCC CTGGAGCCCC GGCTATCTTC TCACTTGCGA ACTGTTCCTC GACGGCCGGT TTGCCGCGAT TGCTCCGGAG CCGGAAGGCG TCGTGGAGTA CCAATGGTTC CCGCATTGCG TGTTTCGCAA CCAGACCATG GGGGGACTCC AAATCTCAAC CCGTATGTTC CTGCCCAGCA AGCAACGTGC GGTGATGCAA ACGATCACGG TAAAAAACGC CAGCAGTGGC CGCAGAACGT TCACGCTTGG TTTCGACATG CGTGGTGCGG TCGCAAAGCA GACGACGCCA TGGTTCGTGA ATTCTCCCGG CGAAGCCGAC AACAAGATCA CATATGACGC GCGACGCGGC TGCCTCATCT TCGAAGCGCA GCACTCACAG ATCGTCGGTG TGCAGGGTTT TCATACCTCT CCCGACCGGG TCGAGCAGAA GCGAATGCTG CTTTTTGACC TCGAGCTTGG TTCCAGGGAA TCAAAATCTC TGAATTTTAC GGTCGCGCTT GCGGGCGACG CTTCGAGCGC GGTGGAACTC TATGACAAGT TGCAGGCTAA CTTCGCAGGG ATCGAGAAGG AGAGTGAAGC GACCTTCGAC CATCTGGTGG GTTCGGCGTT CACACCCGGC AATTCCGAAT TCAGCGGAAA CCTGCCGCGC CTTGTCACCG ACAACGAAGC TCTCTGGAAG CTGTACCATA ACGGTTTCGC CAACCTACTG TTTGCAAGGC GCGTATCGCC CGATTCCGTG TACGGTCCAA CCTATCTCAC GCTCAGTGGA CACGTGCTGC CGACCCTGAG TTTTCCGTGG GACACTTCGC TCACCTCTCT TGCACTAGCA TTGCTGGACC CGACGCCTCT GCGCACGCTT GTCGAGGCAT GGCTGAAACT CGGTTTGCAC GACCATCACT CCAGCGACTA CATCAGCGGG CAAGGTGCAG GGCCGTGGTA TGCGGTGAAC GATACCGCCA TCGTTCGCTG CGCTTGGCGA TACATCTGCG TTACCGGCGA TTTCGCGTGG CTCGACAAGA AAATTGGCGC TCACTCTGTA CTCGAAGGCC TCGAAGAGCA CGCACTCTAC TGGAAGAAGC TCGACCCTGC TGGTCACGGA CTCGGAGATT ACGGCACGAT TGAGAATCTT CTGGAAGTTG TGAGCACGTA TCTCCACGAA GTTGCCGGTA TGAACGCGAA CAATGTACAC AGCATGCGGG TTGTCGCAGC AATGCACGAG CATCGTGGAA ATCAGGTGCG CGCACAACAA CTACGGGCGG AAGCGAAGTC GCTGGCCGAG CGCATCATCC ATGACCTTTA CGTCGCCGGA AAAGGGTATT GGCGCTGTCG CCAGCCAGAT GGCTCGTTCA ACGAAGTCCG TCACTGCTAC GACTTCCTTG CTGTCCTCGA CAATATGGCC GAGGACCTAT CGCCCGCTCA GAAGCAAGAG ATGGCCGCAT TCTTCTGGAG GGAGTTGCGC AGCGAAACCT GGATGCGCGC ATTGTCTCAA AGCGACGCCG ACGCCACTTG GAACATCCGT CCCGACCATA GCTGCCTCGG TGCGTACGGC GCGTGGCCGG CTATGAGTGC GAAAGGTTTG TACAAGGCCG GCAGCTCACC GAAATTGTCG GCGTGGCTGA AGCAGGTCGC AAAGGCAGGG AATCAAGGCC CGATCGGGCA AGGCCATTTC ATCGAAGACG TTTTTCCGCC CGTGAATGGC GGCGCGAGAA AAGCTTCGGA AGATGCGCCA TACATCGAAG ATTGGTGCTG CATTGCGGCG GGTTCGTTTA CCGAACTTGT AATCGATTCG ATCTTCGGAG CGGAGCTGAC GATGAAAGAT GGTATCCGAG TGAACTCGCG TCTAGAGGAT TTCGATCCCA ATGCGCGGCT CGAAGGCCTG CGTTATCAAG GCTCGCTCTA CAGCATCACG AAGAATGGGG CGCAAAAACA AACCGGATAA
|
Protein sequence | MPSSDHDVSR RTFLKFGAAG TTLAALTPTA VANPGFFGTD PIVKKRLTCP PSLADMASDP QRYQFRDLFN SPAAMNEFGY AQVGKSVSAI TAISFPPYAC CAPPSMPWSP GYLLTCELFL DGRFAAIAPE PEGVVEYQWF PHCVFRNQTM GGLQISTRMF LPSKQRAVMQ TITVKNASSG RRTFTLGFDM RGAVAKQTTP WFVNSPGEAD NKITYDARRG CLIFEAQHSQ IVGVQGFHTS PDRVEQKRML LFDLELGSRE SKSLNFTVAL AGDASSAVEL YDKLQANFAG IEKESEATFD HLVGSAFTPG NSEFSGNLPR LVTDNEALWK LYHNGFANLL FARRVSPDSV YGPTYLTLSG HVLPTLSFPW DTSLTSLALA LLDPTPLRTL VEAWLKLGLH DHHSSDYISG QGAGPWYAVN DTAIVRCAWR YICVTGDFAW LDKKIGAHSV LEGLEEHALY WKKLDPAGHG LGDYGTIENL LEVVSTYLHE VAGMNANNVH SMRVVAAMHE HRGNQVRAQQ LRAEAKSLAE RIIHDLYVAG KGYWRCRQPD GSFNEVRHCY DFLAVLDNMA EDLSPAQKQE MAAFFWRELR SETWMRALSQ SDADATWNIR PDHSCLGAYG AWPAMSAKGL YKAGSSPKLS AWLKQVAKAG NQGPIGQGHF IEDVFPPVNG GARKASEDAP YIEDWCCIAA GSFTELVIDS IFGAELTMKD GIRVNSRLED FDPNARLEGL RYQGSLYSIT KNGAQKQTG
|
| |