Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3360 |
Symbol | |
ID | 4071278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3986391 |
End bp | 3987770 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985382 |
Product | hypothetical protein |
Protein accession | YP_592435 |
Protein GI | 94970387 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCAGC GTCGGAAGGG TTTGTTGATC GCGCTCTTGG CAGTTGTTTG CAGCTTCGTG CTGTTGCTGC CTGCCTTGGC CGAATCCAAT GTCCGCATCG TTCGCCTGAG CTACATTGAT GGCGACGCCC AAATCAACAC GACAAACCAG GATGACGGAT TCACTCATGC CGTTCTCAAT ACGCCGGTAA CAGCGGGCAT GTGGATCTAC ACGCCGAACA ACTCGCACGC TGAAATCCAG TTTGAAAATG GCAGCACCGT GCGAATGGTG GATGACGCGC AAATCCAGTT CGAAAAACTC GCGCTTGCCG ATTCCGGCGG AAAGATCAAC ATCATTAATG TTGATCACGG CGTGGTGTAC TTCAACTTCT CGAAGGTAGG CAAAGACGAC AACATCATCG TCAAGGCCGG CGCCAAGACC ATCCACGTTG CAAAGTCCTC GCACTTCCGC GTCGACGCCA GCGACAAGAA TGTTCTCGTC TCCGTGTTCA AGGGCGACGC CATGGTTGAC GGCGACCAGT CGATCGAGAT TAAGAACAAC GAATCCGTGA ACCTTGCCGC CGAAGACGCG AAGGTTGGTA GAGGCGTCGA CGAGCTGGGC AGCGACACGT GGGACAAGCA TCGTGACGGA GAGGTTGCGG CGCTGAGCAT GAAGGCCGCC CCGGTTGGTT ATGGCGATGC GTATAGCTCA CAGTTCGGAT ACCTTGGTTC TTACGGCAAT TACACCAACG TCCCGGGCTT CGGTTGGGGA TGGCAGCCCT ACGGCATGGG AATGGGTTGG GACCCGTTCA TGAACGGCGT CTGGAACTAC AACCCAGGCT TAGGCTACAT GTGGGTTTCG TCGTATCCGT GGGGATGGGG ACCGTATCGC TACGGTGCCT GGAACTACGT TCCGGCCTAC GGCTGGATGT GGATGCCCGG CTCGAGTTTC AATTCCTGGA ACGTGGGTCC AGCATACGGA GCTGTGCCGG CTAACTGGCA CGCACCAACC GTTCCGGTTG TCGGCAAGAC TCCGGTGAAG ACGGTTGTGG TGGGCAATCC ACCGAACGTG CACCCGGCAA TTCTCGCGGG ACATCCTGAA GGTGGATCGC ATGCCGCAGT CTCGACGCGC GCGAAAGCTT CGAACAATGT CCGCGTGAAG CCGCCTGTGG CGACTGCGAC GTCCGGCGCC AAGCCGACGT CGAATACAAC CGCCACGAAG ACTGGCACGA GCACTGGCGC GAAGAGCGGT GCGCAGCCTG CGCATGCCGG CGGCGCACAG CACGCGAGCG GGGGACAACC CTCTGGTCAA CACATGGGCG GACCGCCAAC GGGTGGTGGT CAACGCATGG GTGGCGGAGC ACCGGCCGGA GGCCATCCGC CTGCGACTCG TCCTCACTAA
|
Protein sequence | MLQRRKGLLI ALLAVVCSFV LLLPALAESN VRIVRLSYID GDAQINTTNQ DDGFTHAVLN TPVTAGMWIY TPNNSHAEIQ FENGSTVRMV DDAQIQFEKL ALADSGGKIN IINVDHGVVY FNFSKVGKDD NIIVKAGAKT IHVAKSSHFR VDASDKNVLV SVFKGDAMVD GDQSIEIKNN ESVNLAAEDA KVGRGVDELG SDTWDKHRDG EVAALSMKAA PVGYGDAYSS QFGYLGSYGN YTNVPGFGWG WQPYGMGMGW DPFMNGVWNY NPGLGYMWVS SYPWGWGPYR YGAWNYVPAY GWMWMPGSSF NSWNVGPAYG AVPANWHAPT VPVVGKTPVK TVVVGNPPNV HPAILAGHPE GGSHAAVSTR AKASNNVRVK PPVATATSGA KPTSNTTATK TGTSTGAKSG AQPAHAGGAQ HASGGQPSGQ HMGGPPTGGG QRMGGGAPAG GHPPATRPH
|
| |