Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0860 |
Symbol | |
ID | 4068954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1071545 |
End bp | 1072828 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982869 |
Product | hypothetical protein |
Protein accession | YP_589939 |
Protein GI | 94967891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAATG CCCTTGCGCT CGCCGCGGTC ACGGCTGTGC TCCAGTCGTA TCTGAACGCT GTGTACAACA ATCCGTCATC GGTCTTGGGC AGCGTCTCCG TGACCGCTAT TGCTCCTGAC CTCATTCAGG GCGGTATTGC CGGCGGCGGC AACGCGCCTC TCCAGGTAAA TATCTTTCTC CACCAAGTCA CGCTAAACGC CGCGTGGCGA AACATTGAGA TGCCAACCCT TGCGCCGGAC GGTCAAACCC GCATTGCGAA TCAACCCCTC GCGCTGGACC TCCACTATCT TCTGACCGCG TATGCGCCCG AAGATAGCCA GGCCGAAGCC TTGCTTGGTC TTGGCGTCTT CTTCTTGCAC CAAAATCCGA TGATTGCCCG CGCAGATATC GCTTCGGCGC TAGCAGCCCT TCCACCGAGC TATCCAGCTC CATTCGCTAC CGCGCTCGGT CTCTCGGGAC TTGCCGACCA GGTCGAAATG ATCAAGATCA CTCCCGCCAC TCTTGGTCGC GAGGAGATCG CGTGGCTCTG GACCGCCCTC AAGGCCGACT ACCGCCCGAC GTTTCCCTTT CAGGTATCCG TGGTCCTGAT CCAGCCGCAG AATCCAGTAT TCGCCGCTTT ACCCGTACTA CAACGGATTA TCGAAGCGAA GCCGCTGTCT CCAATTCCAA CGTTGACCGA AGCTGATCCG CCAAACAAAC AGCCTGTCGC ATGTCTCGGA GATACGGTCA CCGTTCAAGG CGCATTCCTG AACGGAACCT CCGCCGTACG GTTGGTCAAT CCACAGCAGG GTCTTCAGTC GGATATCACC GCCATTACGA ATGCCACGAA TGTGTCTTTT AAGTTTGGTA TTCCTAACCC CGTGCTACCG TCCCCACAAC TTCATCCCAC GGACCTCCCC GCAGGCGTTT ACGTGGTCTC CGCCAAGGTC GCATCGGATG GCGACACAGT GGACACCAAT GGCGTTGCCC TCGCGATTGC GCCGAAAATC GATGCGTCTT GGGCGCCCGG AACGATCCCA TCAGGTCTAA ACGTTTCCGT CTCCGTGCCA TGCGCACCCT ATCTCCGCCC TGGGCAGGCT GTTCAACTCC TTATCGGAAG CCAGGCGGCT CCAGCCGACA CCTTCGATAC TCCAACCAAT TCTCCGAGCT TCACCTTTGC CAACCTCACC GCCACTGCCA CACCCGTTCC AGTGCGGCTC CGCGTCGACG GCATCGACAG TCCAATCATC GACATGACGG CGAAGCCTCC GAAATTTACC GGCCCGTCCG TGCAGGTGAC GTAA
|
Protein sequence | MSNALALAAV TAVLQSYLNA VYNNPSSVLG SVSVTAIAPD LIQGGIAGGG NAPLQVNIFL HQVTLNAAWR NIEMPTLAPD GQTRIANQPL ALDLHYLLTA YAPEDSQAEA LLGLGVFFLH QNPMIARADI ASALAALPPS YPAPFATALG LSGLADQVEM IKITPATLGR EEIAWLWTAL KADYRPTFPF QVSVVLIQPQ NPVFAALPVL QRIIEAKPLS PIPTLTEADP PNKQPVACLG DTVTVQGAFL NGTSAVRLVN PQQGLQSDIT AITNATNVSF KFGIPNPVLP SPQLHPTDLP AGVYVVSAKV ASDGDTVDTN GVALAIAPKI DASWAPGTIP SGLNVSVSVP CAPYLRPGQA VQLLIGSQAA PADTFDTPTN SPSFTFANLT ATATPVPVRL RVDGIDSPII DMTAKPPKFT GPSVQVT
|
| |