Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0320 |
Symbol | |
ID | 4068597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 346156 |
End bp | 347979 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982323 |
Product | hypothetical protein |
Protein accession | YP_589399 |
Protein GI | 94967351 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.280201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.603461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATC GTCGGGATTT TCTAAAGACT GCTGGAATTG CGGCAGCCGG CTCCTATGCG AGCCGGCTGT CCGCTGCCCA AGCAGGACCA TCGCAAGCGC ATCGGTTCAC GACCTTCAAC TACGGGGATG TTCAGCTACT CGATGGTCCG CTGAAGAAAC AGTTTGACGA GAACCATGCG TTCTTTCTCA AGCTCGATGA AGACCGCCTG CTGAAAGTCT TCCGCCAGAA AGCCGGACTG CCCGCGCCCG GCGAGGACAT GGGTGGCTGG TACGACCTCA CGGGCTTCGA TCTCGCCAAG GGCGACTTCC ACGGCTTCGT TCCCGGGCAC ACCTTGGGCC AATATGTTTC GGCGTTGGCC CGATGCTATG CCGCCACAGG ATCGGAAGAG ACCAAGGCGA AGGTTCATCG ACTGGTGAAG GGTTACGGCG CCACGCTCGA CGACAAAGCT TCATTCTTCG CCGGCTATCG CCTACCGGCC TACACCTACG ACAAGCTCTC GTGCGGGCTA ATCGACGCGC ACGAGTTCGC ACACGATCCC GACGCGATGG CGATTCACGA AAAGCTGACG CGCGGCATGT TGCAATATCT TCCTGAAAAA GCATTGTCGC GAGCGGAGCA GCGGGCACGG CCCCACAAAG ATGAGTCGTT CACGTGGGAC GAGAGCTACA CGCTGCCGGA GAATCTGTTC CTCGCCTATC GCCGGACGGG CAACAAGTTC TATCGCGAGC TCGGAACTCG TTTCCTGGAA GACGATACCT ATTTCAATCC GCTCTCGGAG GGTATCAACG TGCTCGCGGG TGAGCACGCC TATAGCCACA TGAATGCCTT CTGTTCGGCG ATGCAGGCCT ACCTCACGCT CGACAGCGAA CGGCACCGCA AAGCGGCGCG GAATGGCTTT CGCATGGTCG CCGAACAAAG CTTTGCCACT GGCGGATGGG GACCGAGCGA GGCATTTGTA GAGTTCAACA AAGGCCAGCT TGGCGACAGC CTGGAAAAGT CGCACTCGAG CTTCGAGACT CCTTGCGGCG CGTACGCACA TTTCAAACTG ACACGATATC TTCTCCAAAC CGACGGCGAC TCCACTTACG GCGACAGCAT GGAGCGCGTG ATGTACAACA CTGTGCTCGG CGCCAAACCG ATCCAGCCGG ACGGAACTAG CTTCTATTAT TCGGATTACG CTACCGTTGG CAAAAAGGTC TATCACAACG ACAAGTGGCC GTGCTGCTCA GGCACGCTTC CGCAAGTCGC AGCGGATTAC CACATCAGCA TCTATCTCAA AGCGACAGAC GGCGTTTGTG TAAATCTATT TGTTCCTTCG ACGCTCATCT GGAAGGCCAG CGATGGGAGT TGCAAGCTCA CGCAGGAGAC GAAGTACCCG TTCGAGACTT CCGTCGCAAT GCGATTCGCT ACCACGCAGC CTGTCGAACA AACTCTGTAC ATCAGGATTC CGGCGTGGGT CACCAGCGAA CCTGCACTTC GCGTGAATGG CCAACGCACA GACGTTGCGG CGAAACCGGG AGCATTCGCG GCAATCCGGC GTACTTGGAA GGACGGCGAT CGCATTGATC TCGACCTGCC TATGGGCTTC GAGTTGCAAC CCGTCGATGG GCAACACGAG AAACTCGTAG CCCTTGTTCA CGGCCCCTTA GTGTTGTTCG CAATCGGCGA TTCGCGGCCG CGTTTTCATC GCTCCGACCT GCTCGACGCC AAGCCATCCG CCAACAACGA CTGGAGCGTT CGCGCCGCCG GTGGCAAGCA AGTTGTTTTT AGATCGTTCC TAAAGATTCA GGATGAGAGT TACAGCACGT ACGTTGAGAT CTAG
|
Protein sequence | MLDRRDFLKT AGIAAAGSYA SRLSAAQAGP SQAHRFTTFN YGDVQLLDGP LKKQFDENHA FFLKLDEDRL LKVFRQKAGL PAPGEDMGGW YDLTGFDLAK GDFHGFVPGH TLGQYVSALA RCYAATGSEE TKAKVHRLVK GYGATLDDKA SFFAGYRLPA YTYDKLSCGL IDAHEFAHDP DAMAIHEKLT RGMLQYLPEK ALSRAEQRAR PHKDESFTWD ESYTLPENLF LAYRRTGNKF YRELGTRFLE DDTYFNPLSE GINVLAGEHA YSHMNAFCSA MQAYLTLDSE RHRKAARNGF RMVAEQSFAT GGWGPSEAFV EFNKGQLGDS LEKSHSSFET PCGAYAHFKL TRYLLQTDGD STYGDSMERV MYNTVLGAKP IQPDGTSFYY SDYATVGKKV YHNDKWPCCS GTLPQVAADY HISIYLKATD GVCVNLFVPS TLIWKASDGS CKLTQETKYP FETSVAMRFA TTQPVEQTLY IRIPAWVTSE PALRVNGQRT DVAAKPGAFA AIRRTWKDGD RIDLDLPMGF ELQPVDGQHE KLVALVHGPL VLFAIGDSRP RFHRSDLLDA KPSANNDWSV RAAGGKQVVF RSFLKIQDES YSTYVEI
|
| |