Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4120 |
Symbol | |
ID | 4072311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4879933 |
End bp | 4881402 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986151 |
Product | anthranilate synthase, component I |
Protein accession | YP_593194 |
Protein GI | 94971146 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.622363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCGC CAGATTTCAA GTCCTTCTCG CAGCTAGCAC GCGAAGCCTC GCTCGTCCCC GTCACGCGCA CGATTTCGGC CGACCTCCTC ACTCCGGTTT CCGCCTTCCT TGCTCTGGCC GACAAAGAGC CTTACGCCTT CCTGCTGGAA TCCGTCGAAG GCGGCGAGCG TATTGGACGC TACACCTTTC TCGGCATCCG GCCCTACATG GTCGTCACCG GCCGCGGCAG CGAAGTTACG ATACGCCGCG GTAAGAAGAC CGAAAAGTCG TCTTCCGATC TACTTGGAAC CCTGCGGGCC GCGTTACGCG AGCACAAGCC CGCCACCGTC CCCGGATTGC CGCCCTTCAC CGCGGGCGCT GTCGGCTACT TTGCTTACGA TGCCGTGCGC CACTTCGAGC GCCTGCCCGA CATCGCCAAA GACGACCTCC ACCTTCCCGA CGGCGTCTTC ATGTTCTACG ACCGCCTGCT GGCCTTCGAT CACCTGCGCC ACCAGTTGCA CCTCATCGCC GCCGCCGACG TCCGCACCGA GAAGCCGCGC GCCGCCTACG ATCGCGCTAT CGCCGATCTC GATGCGCTGG AGAAAAAGCT CGTATCGGGA CTGAAGATTC GTCGCCTGCG TCCCGAAAAG AAAACCGCGA AGATCAAGCT GCACGCCCGC ACAAAGCCCG CCGACTACAT GAACGCCGTG AAGCGCGGCA AGGAATATAT CGCGGCAGGA GATGTCTTCC AGGTCGTGCT CTCCCAGCGC CTCGACTTCG CACTGCCCGC GCCTCCCTTC GACATCTACC GCTCTCTGCG CACGGTGAAT CCGTCGCCCT ACATGTACTT TCTGCGCATG GACGACCTCC ACGTCCTCGG CTCGTCGCCC GAGATGCTGG TGAAAGCCAA CAACCGCACG CTGGAGTACC GCCCGATCGC CGGGACCTAC AAGCGCGGCG CGACCGCCGA AGAAGATGCG CGTCTCGAAG AGCACCTTCG CACCAACGAA AAAGAGCGCG CCGAGCATGT GATGCTCGTA GATCTTGGAC GGAACGATCT CGGCCGCGTG AGCGAATACG GCTCTGTCAA AGTAAAAGGC CTGATGTACG TAGAGCGCTA CTCGCACGTG ATGCATCTCG TCTCCGCGCT CGAAGGCAAA CTGCGCGGCG ACCTCGACGC GCTCGACGCC TTCGCCGCCT GCTTCCCCGC CGGCACCCTC AGCGGCGCGC CCAAAGTCCG CGCCATGGAA ATCATCGAAG AACTGGAACC CACCCGTCGC GGCGTCTACG GAGGTTCGGT TTTGTATGCC GACTTCGCCG GCAATCTCGA CTCCTGTATC GCCATCCGCA CCATGGTCGT GAAAAACAAC CGCGCGTATG TCCAAGCCGG CGCCGGCATC GTAGCCGACA GCGATCCCGA AAGCGAATTC CAGGAGTGCC GCAACAAAGC GCAAGCGGTC GTCCGCGCCG CCGAACTGGC GGGACGATAG
|
Protein sequence | MDSPDFKSFS QLAREASLVP VTRTISADLL TPVSAFLALA DKEPYAFLLE SVEGGERIGR YTFLGIRPYM VVTGRGSEVT IRRGKKTEKS SSDLLGTLRA ALREHKPATV PGLPPFTAGA VGYFAYDAVR HFERLPDIAK DDLHLPDGVF MFYDRLLAFD HLRHQLHLIA AADVRTEKPR AAYDRAIADL DALEKKLVSG LKIRRLRPEK KTAKIKLHAR TKPADYMNAV KRGKEYIAAG DVFQVVLSQR LDFALPAPPF DIYRSLRTVN PSPYMYFLRM DDLHVLGSSP EMLVKANNRT LEYRPIAGTY KRGATAEEDA RLEEHLRTNE KERAEHVMLV DLGRNDLGRV SEYGSVKVKG LMYVERYSHV MHLVSALEGK LRGDLDALDA FAACFPAGTL SGAPKVRAME IIEELEPTRR GVYGGSVLYA DFAGNLDSCI AIRTMVVKNN RAYVQAGAGI VADSDPESEF QECRNKAQAV VRAAELAGR
|
| |