Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3800 |
Symbol | |
ID | 4071084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4490244 |
End bp | 4491641 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637985823 |
Product | hypothetical protein |
Protein accession | YP_592874 |
Protein GI | 94970826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCTTA CACAGACAAG GCTGCAGAAC GTCTGGCTGA AAGCGCTAAT GGCTTGCTTG GTAGGTTATG CCCTACTAGG CAAGGGTTTC GCATACCTTT TCATAGGGGA ATGCGTTCTC GTTGCTGGGT TCGCGATATT CTTACTTTCT CGCAGAGCGA CGTTAGTGGC GAGCGATAGT GTGCTTTTAC TCTGGGGACT CTTCGCCTTT TGGGGGGCCT GCCGAACTAT CCCATTTCTA TCGACGTACC ACTTCGATGC GATCCGAGAC GCAGTCCTTT GGGGGTACGG TCTTACCGCG CTGTTCATTG TGGCGTCCGT CAATAGCGCG GAGCAAATTG CGAGTGGGCT TAACGCCTAC CGCAAGTTCC TTCGCTGGTA CATGCCATTA TTGCCAATTA TCCTCCTGCT ATCCGGTCCT TTGAGGCCGC TGATGCCCGT TGTTCCGTGG AGTCGAGACG CTGCCATCGT TATGTTGAAG CCCGGTGATG CAAGTGTCCA TCTCGCGGCA GCTGCACTTT TCTGGTTGAT CCTTGAGAGG CAGAGCTCCG CGCGGAAGAG AAGAGGTTTC TCGGCCATGC AAGGCGTTGC TATAGCCGGA TGGTTTGGCG CTACGATGTT TGTACTAGTC AGAACTCGTG CTGGGGTTCT TGCCATCATC ATTCCGATGG CCCTGGTTTC ACTTCTGAAA TTGCGAAGAG TAGCCTGGAA GGTTGGCGTA TTTGCCGTTG CAGGAACTTT CCTGCTGGCA ATGATCTTGG AATCGAATCT GATCCAGATC AATATACACG GCCGTAAATT CAGTTCGGAG CAGATTACAA ACAACCTGTA TAGCATAGCG GGCCAAGGAG ATGAGAAGAC CGATCTTGAG AATACAAAGG TGTGGAGACT TATCTGGTGG AGGCACATAG TTCAATACAC TGTTTTCGGC CCTTATTTCT GGACTGGAAA AGGCTTCGGA ATCAATCTGG TGTTGCAAGA TGGCCCGCCC CATGTAACGG AGGACGATAA GACAACGCGT AGCCCTCACA ATGGAAGTAT GACAGTGCTA GCGCGCATGG GAGTTCCTGG TCTCGTGATG TGGGCCTCGC TGAATCTCGT GTTTCTCTTC CGAATGCTCC GGGCCTACCG CCGCGCTGCC CGATCGGGAG CCCAGTTCTG GGCTTCGGTG AATCTATGGG TTCTATGTTA TTGGATTGCT GCTTTTATCA ACTTGAGTTT CGATGTTTAC ATCGAAGGGC CAGTGGGTGG AATTTTGTTT TGGTCGATTA TCGGCTTCGG CGTGTCCTGC CTGCGCGTAC AGAGCTATGA AGCGCGTCAG ATTGCGCACG GCCGAGTGAG AAATTTTCAT ACCAGGTCAG CAGAGCAGTT GGCGGTCAGG GAATTATCGC CATCCTGA
|
Protein sequence | MYLTQTRLQN VWLKALMACL VGYALLGKGF AYLFIGECVL VAGFAIFLLS RRATLVASDS VLLLWGLFAF WGACRTIPFL STYHFDAIRD AVLWGYGLTA LFIVASVNSA EQIASGLNAY RKFLRWYMPL LPIILLLSGP LRPLMPVVPW SRDAAIVMLK PGDASVHLAA AALFWLILER QSSARKRRGF SAMQGVAIAG WFGATMFVLV RTRAGVLAII IPMALVSLLK LRRVAWKVGV FAVAGTFLLA MILESNLIQI NIHGRKFSSE QITNNLYSIA GQGDEKTDLE NTKVWRLIWW RHIVQYTVFG PYFWTGKGFG INLVLQDGPP HVTEDDKTTR SPHNGSMTVL ARMGVPGLVM WASLNLVFLF RMLRAYRRAA RSGAQFWASV NLWVLCYWIA AFINLSFDVY IEGPVGGILF WSIIGFGVSC LRVQSYEARQ IAHGRVRNFH TRSAEQLAVR ELSPS
|
| |