Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3947 |
Symbol | |
ID | 4071330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4668019 |
End bp | 4669422 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637985973 |
Product | hypothetical protein |
Protein accession | YP_593021 |
Protein GI | 94970973 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3420] Nitrous oxidase accessory protein |
TIGRFAM ID | [TIGR03804] parallel beta-helix repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.917121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.453465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTGAAA GCAGAAACGC AATGGGCCGA ACCATGGGGT TGTTCGCAGT TGTCACTTTA CAGTTCGTCA TGCTTTGGGG GAGCAGGGCG ATTGCCAGCA CAGTCACACT TAAACCGGGC GAAAACGTGG CCGCAGCGGT CGCGAACGCT CCTGCAGGGT CAACCTTCGT ATTCACTCCG GGTACGTATC GCATGCAATC GATCATTCCG AAAGACAACG ATATCTTTGT CGGGCAATCA TCGACCGGCG TCATCCTGAA TGGCGCCAAG GTTCTAACGA TGGAACCGAA TGGCAAGTAC TGGACGAAAA TCGAGCCGCT GAATCCAACG GTTTACGTCG CAAACCATTG CAATCCGGGA CACGCGCGTT GCTACATCCT GAACGATCTG TTCATTGATG GAAAGTTGCA ACGCCCGGTG AGTTCGCTCA GCAGTCTGGC GGCGGGACAC TGGTATTACA ACCTCACCAC GGGAACGATT TACATCAGCA CCAATCCGGC TGGACACGTG GTGGAGTGGG CCTATACCAC GTATGCCTTC CGGGGAGCCG CGACCGGTGT GCAAATCAGC TTTCTCACCG TGAAGAACTA TGCGACGCCC CCGCAAGCGG GTGCGATCGG AGGGCCGAAC GGCAAGGCGG AGCATTGGTA CATCCACAAC GTGAATGTCA TGCACAATCA CGGGGCCGGG ATTGCCATCG GAAACTACAG CAAGGTGATG TACTGCAACT CGTCGAGCAA CGGCCAGGAG GGGCTTGCCG GCCACGGCGC GTATATCACG ATTGAACACA ACACCTTCGC CTACAATAAC CAGGCCAGCT ACATGAACTT CTGGGAAGCG GGAGGCGCAA AAGTCACGGA TACCAGCCAT TTGCTGCTTG GCTACAACTA CGTTCACGAC AACCTCGGCA CGGGATTGTG GGAAGACATG TACAACACTG ACTCGGTCGT CGAAAACAAC ACCAGCATTA ACAACCTGGT GGGTATTGCC GAAGAGTTCG CGTCGAACCT GACGCTCAGG AACAATGTCG TGCGCGGCAA CAGGAAGATG GGCATCCTGA TTTCGCTTTC CCGGTATGCG GAGGTCTATG GCAATACCGC GGAAGTTCCG GTGAACGGGA TTGACGCGAT CCGGGTCGCG GAAGGTCAAC GCGACGGGAT GAACACCCAC GACGTTCACG TGCACGACAA CATCATGATC TTCGACGGAA CAAAGTCGGG TCGCACTGGG CTCTCAGGAA ATCTCGATAC CGCGACCAAC GTGACTTTCA ACAACGACAA GTACTACAAG AAGAACGGTG GGTACTATCA CTGGTTGTGG GGCGGATCCA CCTGGATTTC CTTCACTGCC ATGCAGAAGG CCGGACAGGA GTTGACGGGA ACCGTTTCGA CCGGTGCGCC GTAA
|
Protein sequence | MCESRNAMGR TMGLFAVVTL QFVMLWGSRA IASTVTLKPG ENVAAAVANA PAGSTFVFTP GTYRMQSIIP KDNDIFVGQS STGVILNGAK VLTMEPNGKY WTKIEPLNPT VYVANHCNPG HARCYILNDL FIDGKLQRPV SSLSSLAAGH WYYNLTTGTI YISTNPAGHV VEWAYTTYAF RGAATGVQIS FLTVKNYATP PQAGAIGGPN GKAEHWYIHN VNVMHNHGAG IAIGNYSKVM YCNSSSNGQE GLAGHGAYIT IEHNTFAYNN QASYMNFWEA GGAKVTDTSH LLLGYNYVHD NLGTGLWEDM YNTDSVVENN TSINNLVGIA EEFASNLTLR NNVVRGNRKM GILISLSRYA EVYGNTAEVP VNGIDAIRVA EGQRDGMNTH DVHVHDNIMI FDGTKSGRTG LSGNLDTATN VTFNNDKYYK KNGGYYHWLW GGSTWISFTA MQKAGQELTG TVSTGAP
|
| |