Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3532 |
Symbol | |
ID | 4069263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4178806 |
End bp | 4180173 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985555 |
Product | hypothetical protein |
Protein accession | YP_592607 |
Protein GI | 94970559 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACCT ACACGCTGCT GCGACTCATC GCAGTGGCTT TCCTGGTAGC GCTCAACGCC TTCTTCGTAG CGGCGGAGTT CGCCCTCGTC AGCGTGCGCG ATACCAGGCT CCAGCAACTC ATCGACGCGG GCAAGATTGG CGCCCGCACC GTCGAGCGCC TGCACAATCG CCTCGACGAA GTTCTTGCCG CGGTTCAACT CGGCGTCACC ATCGCCAGTT TGGCGCTCGG CTGGATTGGC GAACTGGCGA TTGCGGTCAT ACTTGAACCG CATTTCGTCC ACCTGCCGCA TGGGCTTTAC TACGCACACG GGCTAGCGGC GACGATCTCG TTCACGATCA TTACCTTCTT CCACGTTACG CTCGGCGAAG TCGTGCCCAA GACATTGGCG CTGCAGCGCG CTGAACAAGT GGCGCTCGCG GTCGCGACGC CGATGGAAGT TTTCATCGCT GTCGCGCGGC CGCTGCTGGC GGTGATGCGC ATGGCAGCAC GTTTCGTTCT GCGTTTGTTC GGCACCAAGG AAATGCGCGA AGGCGGCGTG CACTCACCCG AGGAACTCAA GCTGATGGTG ACGGCGAGCC GCAAGTTCGG CCTTGTGCCG AGACTCCAGG AAGAAATGAT CAACCGCGCC ATCGATTTGG AAAATATCTC GGTGCGCGAG ATCATGGTGC CACGACCGGA CATCTTCTCG CTCCCCGGCC ACATGACGCT CGACGAAGCC GTGCAGCGCG TTGTGGACGA ACAACACTCG CGCATTCCGA TCTACGACGC CGAGCGCGGC CCCGAGCACA TTATCGGTGT GCTCTACGCC AAAGATCTCA TGCGCTGGAT GCGCTATCGC ATCGCGCGTC TCCAACAGAA CCGCCCAGCG CGTATCGCGT CGAATCTCAA GGTCCAGCAC ATCATGCGCG AGGTGCTCGT CGTTCCTGAG ACCAAGCCGC TCACCGACCT CCTCGAAGAA TTCAAAGAAC GCAAGCGGCA CCTTGCCGTC GTCGTCGATG AGTTCGGTTC GACCGCCGGC GTGGTTACGG TTGAAGATGT GCTCGAAGAA CTGGTCGGCG AAATCGAAGA CGAGCACGAC GTTCCCGAAG AATCGGCGCT CACTCCCGGG GGCACCACCT TGGTTCTCGA CGGCGGTATC AACATCCGCG ATCTCGAGTC GCAATACCAG GTGCGTTTGC CGCGCGACGA AGGCTTCGAG ACCCTTGCCG GCTTCGTCAT GACCCGGCTG CAACGCATTC CGCGCGAAGG CGACAGCTTC GCCTTCCACA ACTATCGTTT CACCGTGCTC GAGATGGAAG GCCGCCGCAT TGATAGCGTC AAACTCGAAC TGATCCAGCA AGCCGAAGAA CTGGAGCAGC CGACCTAA
|
Protein sequence | MVTYTLLRLI AVAFLVALNA FFVAAEFALV SVRDTRLQQL IDAGKIGART VERLHNRLDE VLAAVQLGVT IASLALGWIG ELAIAVILEP HFVHLPHGLY YAHGLAATIS FTIITFFHVT LGEVVPKTLA LQRAEQVALA VATPMEVFIA VARPLLAVMR MAARFVLRLF GTKEMREGGV HSPEELKLMV TASRKFGLVP RLQEEMINRA IDLENISVRE IMVPRPDIFS LPGHMTLDEA VQRVVDEQHS RIPIYDAERG PEHIIGVLYA KDLMRWMRYR IARLQQNRPA RIASNLKVQH IMREVLVVPE TKPLTDLLEE FKERKRHLAV VVDEFGSTAG VVTVEDVLEE LVGEIEDEHD VPEESALTPG GTTLVLDGGI NIRDLESQYQ VRLPRDEGFE TLAGFVMTRL QRIPREGDSF AFHNYRFTVL EMEGRRIDSV KLELIQQAEE LEQPT
|
| |