Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2477 |
Symbol | |
ID | 4072101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2933234 |
End bp | 2934250 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984494 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_591552 |
Protein GI | 94969504 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.509569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGTTCC AAGAACGGTA CTCCCGGCAA ATTCTGTTCC ACGGCATTGG GGCTGAAGGG CAGCAGAGGC TGGCCGCGGG ACGGGCAGTA ATCGTCGGTT GCGGAGCAAC CGGTTCGGCG CTGGCCTCCC TTTTGGCGCG CGCCGGCGTG GGCTATTTGC GAATCGTGGA TCGCGATTAC GTAGAGCCGA GCAATCTGCA AAGACAGGGT CTGTTCGACG AGAACGATGC CGCCGAGGCG CTTCCGAAGG CCATCGCGGC AGCGCGAAAA ATCCATGCGT TTAACAGCGA GATCACGGTT GAACCTCATG TGGATGACCT GACTCCCGAC AACGCCGACG ATCTACTCGC GAACGTGCAA TTGATCCTCG ACGGAACCGA CAACTTCGAG ACGCGCTATC TGATTAACGA CTACGCCGTT AAGAACGCCG TGCCGTGGAT CTACTCCGCG GCCGTAGGCA GCTACGGCGT GGCAATGAAT ATCCTGCCCG GCGAAACGGC TTGCCTGGCA TGCGTTTTCC CCGATTCGCC GCGCGGTGTG GTCGAGACGT GCGATACCTC AGGAATCTTG AACACCGCTG TGAACGAAGT GGCATCGCTC TCAGCGACGG AAGCGTTGAA ATATTTTGTC GGCGCCCGGG AGAAGATGCG ACGGACGCTG GTCTCAACCG ATGTCTGGAC CAATGAACGG TCGGAGATTC GGACCGGCGG ACCGAAACCT GGCTGCCGGT GCTGCGGGAA GCGCGACTTT AGCCACCTGT CCGGGGAGGG GCGTCCGCAT ATTTCCTTGT GCGGACGCAA TTCGGTGCAG ATCCATGAGC GGCAGAGGCC GATCGACTTC GCGATGATGG AGACACGACT GCGGCCGCAT GGCCAGGTAC GCCACAACGA ATTCGCGCTG CGGTTTTTCC ACGAACCTTT CGAGATGACC CTGTTCCCGG ACGGACGGGC GATCATTAAG GGGACGACGG ATATTGGCGT GGCTCGAAGT TTGTATGCGC GGTTCGTGGG ATCGTAG
|
Protein sequence | MQFQERYSRQ ILFHGIGAEG QQRLAAGRAV IVGCGATGSA LASLLARAGV GYLRIVDRDY VEPSNLQRQG LFDENDAAEA LPKAIAAARK IHAFNSEITV EPHVDDLTPD NADDLLANVQ LILDGTDNFE TRYLINDYAV KNAVPWIYSA AVGSYGVAMN ILPGETACLA CVFPDSPRGV VETCDTSGIL NTAVNEVASL SATEALKYFV GAREKMRRTL VSTDVWTNER SEIRTGGPKP GCRCCGKRDF SHLSGEGRPH ISLCGRNSVQ IHERQRPIDF AMMETRLRPH GQVRHNEFAL RFFHEPFEMT LFPDGRAIIK GTTDIGVARS LYARFVGS
|
| |