Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4542 |
Symbol | |
ID | 4070221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5386154 |
End bp | 5387419 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986582 |
Product | hypothetical protein |
Protein accession | YP_593616 |
Protein GI | 94971568 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00650799 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.54265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGG GCCTTTTTGC GCGGGAATTG GGCCCAGCGC AGAAGATCCT CGCGACGGCG GCTCTCGCCT TCGCGGCGTT CGGCTGCGCT CTGCACCCGT ACGTTTATCA CGACTTCCTC ATCGTGCCGT ACTTCGCGGT GGGGCTTGCC TGCATTCTTA TTCTTCAACT GAGAGTCATG CCCTCGGTAC GCGATGCTAT CGCGGTCGTC GTGCTTGGAT TGGCGCTGCT GCAGGTGGAC CTGCGGCTGC TTGGGTACGC GACATCTGCG ATGGCGGTGT TGTCGTTGTT CGGGTTGGCG AGTTTGCTGG TGCTGGGATG GCGTGCGATT TGGGGCAAAG CGAAAGCAGA CGCGTTGCAC CGGGCTTTTG TGGCCGCAAT AGGGTTGGGC GTTTGCGTGG CATTTACGGG CCTCTATATC GAGCGCAGTG CCTTCTGGCA GACGAAGATG TACGACCTGT TTTTGTATTC CTTTGACGCG AGCCTGGGAG GACAGTGGAC GTTCCGGCTG GCGCAGTTTA CGGCGCACCA TCCAGGGGCT CATTTTGTGT CGGCGATGGT CTACAACGTG GTGCTCGTGC CACCGGCACT GGTGTATGCG GCGCTATTGA ATGACGAGCG TCGTGCGCGG ACTGCGCTCT GGGCGTTTCT GATTGTGGGC CCGCTGGCGT GCGTGTGTTT CCTGCTCTTT CCAGCGACGG GGCCGGTGTA TGCGTTCAAG ACCTTTCCGA TGTTGGCCGT TCCTGCTGGC GAGATCGCAC GACTGGTTCC AGGGCCGGTC GGGATCAGCG GGCCGAGAAA TGCGATTCCA TCGTTGCACT TTGCGTGGGT ACTGCTGGCG TATTGGAACT CGCGAGACAC GAAGGCAGCG ATTCGTGTTT TTTGTGCAGT GATGCTCGCG CTGACGATCT ACGCGACGTT GGAGACGGGT GAGCACTACG GCGTGGATCT TCTGGTGGCG GTGCCGTTCG CGCTGGGGAT CCAGGCGTTG GCGATGTGGC TGGGTGGGAT TCGAAGCCGG TGCGTCACGC AGGCGATCTT TGTGCCGCTA GGGATCACCG TTGCATGGTT CGTGTTGCTG AGGTTCTGCA ACCGCGTTTG TTGGGTTTCT GCGGTTGTGC CATGGGCAGC GGTGCTGCTA ACGCTTGGAG CATGCCTGTA TCTGTATCGG CGGCTGGTGG CCGTGCAGAA GGAATCCGGC TCTATCGAAA AGCAGAGCGT GTCGCGAGAG ACGACGGATT TGGTGCACGC AGGATCTGCG GGCTAA
|
Protein sequence | MSAGLFAREL GPAQKILATA ALAFAAFGCA LHPYVYHDFL IVPYFAVGLA CILILQLRVM PSVRDAIAVV VLGLALLQVD LRLLGYATSA MAVLSLFGLA SLLVLGWRAI WGKAKADALH RAFVAAIGLG VCVAFTGLYI ERSAFWQTKM YDLFLYSFDA SLGGQWTFRL AQFTAHHPGA HFVSAMVYNV VLVPPALVYA ALLNDERRAR TALWAFLIVG PLACVCFLLF PATGPVYAFK TFPMLAVPAG EIARLVPGPV GISGPRNAIP SLHFAWVLLA YWNSRDTKAA IRVFCAVMLA LTIYATLETG EHYGVDLLVA VPFALGIQAL AMWLGGIRSR CVTQAIFVPL GITVAWFVLL RFCNRVCWVS AVVPWAAVLL TLGACLYLYR RLVAVQKESG SIEKQSVSRE TTDLVHAGSA G
|
| |