Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2311 |
Symbol | |
ID | 4071465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2738896 |
End bp | 2740326 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984327 |
Product | hypothetical protein |
Protein accession | YP_591386 |
Protein GI | 94969338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.466127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.419829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAA GAATATTGGT GGGCATCATG CTGTCGCTCG CAGCGGCGTG CCTGGCAATC TTGATCGCAT GCGGTGGCAG CAGCAGCATG AACTCCAACA AGACGACGGG GACAGTCAAC CTCTCAGTCA GTGATCCGCC CACGTGTGCG GCCCCAGCCG GCCCTTATTC CAACGTCTGG GTGACGATCA AAGACGTGCA AATTCACCAG AGCGCGTCCG CGGGACCAAG TGACGCGGGC TGGGTGGATC TCACGCCGAA CCTCAAATCA GCACCGCAGC AGGTGGATCT GCTCGGGATC GCCGGCAACA ATTGTTTCCT CGCGATGCTT GGTTCAAACG TCGAACTGCA AGCCGGGAGC TACCAGCAAA TTCGAATCTA TCTCTCCGAC AGTTCCGACG CCAGCAAACT CACGACGAAC CATTGCAGTG GATCCGACGT GAATTGCGTT GTCACTGGCG GAAACACTTT CACTCTTGAG CTCTCCAGCG AATCGAATAC CGGCATCAAG ATTCCATCCG GACAACTGGC AGGCGGCAAC TTTACGATTG CGGCCGGAGA AGTGAAGGAC CTCAACATCG ACTTCGACGC CTGTCTCTCG ATCGTGCATC AAGGCAATGG TAAATATCGG CTTAAGCCCG TGCTGCATGC CGGAGAAGTT CAACTGACAT CCTCCTCGGT TACGGGTTCG CTTGTAGATA GCATCTCGCA TACGTCCATC GTTGGTGGTG CGGCGGTGGT TGGGCTGGAG CAGAAGGACG CGAACGGGAT CGACCGCGTC ATCATGCAGA CGGTTACGGA TGCCCGCGGC AACTTCGTTT TTTGCCCCGT GCCAGCCGGA ACGTACGACG TTGTAGCCGT GGCAGTAAAT GGAGCCGGAG TGGCCTACGC TGCCACGATC ACGACTGGCG TCCAACCCGG GAATGCTTTA GGAAATGTTC CGATGGTGGC TCAGGTAGGA GTTCCACTCA CCAACGCGGA AATTGATGGG GAAATTACTT CGAGCACGGG GAGCGCAGCG GCAGCGGCGG ATATAACGTT CTTCGCGATG CAATCGGTCT CTATCGAGGG CTCGACGGTG AACGTGATTA TTCCACTGGC ACAGCAATGG AGCTCGGCAA CCGCGTCCAT GACCACGGAT CCGACGTCTG CGTGCGCGAC GGCGACGGCC GCATGCGTCG CCTACCAGGT GTTCTTGCCG GCGATGTGGC CGAATGTTGG TGCATACGCT GCCTCCGGCG CGACTTACAC CCAGAACAGC GCGACGCCAG TAACGTACGC GATCGGCGCC GATGCGTTTA TTCCGGGATC GGCAGGAACG TCCGACTGCA CGCCGCCGGG TGAGATCACG ACCACCGGCG GTACGCCGAT GACCGTGTCG CCAGGATCGC CGACCCCCGC CCCGACATTG GCGTTCACCG GGTGCCAGTA G
|
Protein sequence | MKPRILVGIM LSLAAACLAI LIACGGSSSM NSNKTTGTVN LSVSDPPTCA APAGPYSNVW VTIKDVQIHQ SASAGPSDAG WVDLTPNLKS APQQVDLLGI AGNNCFLAML GSNVELQAGS YQQIRIYLSD SSDASKLTTN HCSGSDVNCV VTGGNTFTLE LSSESNTGIK IPSGQLAGGN FTIAAGEVKD LNIDFDACLS IVHQGNGKYR LKPVLHAGEV QLTSSSVTGS LVDSISHTSI VGGAAVVGLE QKDANGIDRV IMQTVTDARG NFVFCPVPAG TYDVVAVAVN GAGVAYAATI TTGVQPGNAL GNVPMVAQVG VPLTNAEIDG EITSSTGSAA AAADITFFAM QSVSIEGSTV NVIIPLAQQW SSATASMTTD PTSACATATA ACVAYQVFLP AMWPNVGAYA ASGATYTQNS ATPVTYAIGA DAFIPGSAGT SDCTPPGEIT TTGGTPMTVS PGSPTPAPTL AFTGCQ
|
| |