Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3397 |
Symbol | |
ID | 4072733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4021732 |
End bp | 4022973 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637985419 |
Product | hypothetical protein |
Protein accession | YP_592472 |
Protein GI | 94970424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.117758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCCGGG GCCCCCTTCA AGACCTGCTG CGACGCACAA AACGGTTCAA ACGAGGAAGG ATTCGCAACC GGCGTGTCTC TCTTCAACGA CGCTTACGGC GGCGGAAGAA TCTGGTGCCT CAGTTACGCG TTGTTATTCC TCCGAGCTTC TCGATCCTCG ATGATCCCGA ATCGAACCTC GCTTTCATAT CGCGATTCCG AAGCCAGCTT TCCGCCAAGA TGTACAGGTC TGTTCACTTC GATCACTCAG GGTGCAAGAA CTTGGGAATG GATGCGCAGG CGATTGTGGA TGTCCTCGTT GCCGAGGAAT TGGCTAGGCG ACCGAACGGA ATTGGAATAG GTGGAGACTT TCCCCGCGAC GCTCGGACGA ACGTAATGCT GCGAGCGATC GGAACGTTAC GCCAGTTCGG ACATCCCGAA ATGAAGCTGG CTCCAGAAGT CGAGAGTCGC ATCGAACGCT GTGACCGCGT CAATGGCAAC GGACACAACC TCAAATACAG TTCTGAGCGG GACCGTGCTG CACTAGCATT GGTCACGTAT GTTGAGCGCG TGCTCCAACA TCAGTCCTTC ATGCTCACCT TCGAAGGTCG TTCCGACTTG AGCAGCATAA TCACGGAGGT AATCGGCAAT GCCGAAGAAC ACAGCGGGCG CTGGTATGCC GTCGCGTTCT CGCAGCCGGG CATCGTTGAA CAACAAGGCA TGCCTGAACC CGAGGAATGT CAGATGGTCC TCTTTAATTT TGGCCGCTCC ATTTATGAGT CCTTGGTTTC GAGAGGAGCA TCTACCTATG TGAAGGAGAG GATCTCAGCC TTGGCAAATG AGCATCGTCA TTCCAGGCAG TTTTCCGACA GTTGGACAGA GGAAGATCTC TGGACTCTGG CAGCTCTCCA GCAAGGTGTC AGTCGATATC GAACCGACGA AAAGGGGAAG ACGCGCGGAA ATGGGACGAT TGAACTCATA CGCGCCTTTT CCGAGCTATC CGATGTACCC AAAAAAATGT GCGTCGTTTC AGGGCACACG TATATACTCT TCGACGGAAG CTACAAATTG CGCGCCGACT CCAACGGTTT GCAAATGATT GCCTTCAATA CATCGAACGA TTTGGAAAAA CCTCCAGACC CACGGTACGT TCGTCACTTG AAACACGGGT TCCCAGGCAC GATCATCAGT ATGCGATTTG TGATGGATTC CAGATACCTG GAATCGCGGA TTCAAAGCAA TGGCTCATCA GAACGTAATT GA
|
Protein sequence | MRRGPLQDLL RRTKRFKRGR IRNRRVSLQR RLRRRKNLVP QLRVVIPPSF SILDDPESNL AFISRFRSQL SAKMYRSVHF DHSGCKNLGM DAQAIVDVLV AEELARRPNG IGIGGDFPRD ARTNVMLRAI GTLRQFGHPE MKLAPEVESR IERCDRVNGN GHNLKYSSER DRAALALVTY VERVLQHQSF MLTFEGRSDL SSIITEVIGN AEEHSGRWYA VAFSQPGIVE QQGMPEPEEC QMVLFNFGRS IYESLVSRGA STYVKERISA LANEHRHSRQ FSDSWTEEDL WTLAALQQGV SRYRTDEKGK TRGNGTIELI RAFSELSDVP KKMCVVSGHT YILFDGSYKL RADSNGLQMI AFNTSNDLEK PPDPRYVRHL KHGFPGTIIS MRFVMDSRYL ESRIQSNGSS ERN
|
| |