Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3820 |
Symbol | |
ID | 4071104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4516295 |
End bp | 4517326 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637985843 |
Product | carbamoyl phosphate synthase-like protein |
Protein accession | YP_592894 |
Protein GI | 94970846 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.937743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.217924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGATC GCGATCGGGA GCACTCGAAA AGAAGGTCAG GTTCTTTCGT GAAACAACCA GCAGTCTTAA TTACGAGCGC AGGGCGACGC ACGTCGCTCC TTGGCTCGTT CGTGAAAGCC GCGCAGGGGC GCGGATGGCG TGTATTGGCG GGCGACCGGG ATCCGCTCGC GCCAACTCTA TACTTAGCAG ATGAGGGACT GAAACTCCCA TCTTTGTCGG AGCACGACTA TATTCCGTGC CTTCTTGAAC TGGTTCGTGA ACGCTCCATC CGGATGATCG TTCCCACTAT TGATACTGAA TTGGGATTGC TCGCCCGCCA CGCCGACGAC TTTCGAAAGG AAGGATGTGT TGCGGTGATT TCGTCGGGAC GGCTCATTGA AATCGCGGGT GATAAGTGGC TCACCATGCA GGAATACGCT GCGCGAGCCG TGCGAACGCC AAAATCCTGG CTTCCCGATG CCCTTCCATC GCATGGCTTA CCGGAGAGTC TTTTTGTAAA GCCAAGAGAC GGCAGTGCCA GCCAACACGC GTATCGCGTG CAAAGCTCCG AATTGGCGAG AAAGCTGCCG GAAGTCCCCA ACGCAATTGT TCAGGAAGAG CTTCGGGGCA ACGAGATTAC GATCGACGCG TTGATCGATT TGAATGGGCA GCCTCTTCAT TACGTCCCTC GAGTTCGCAT ACGTACTCTT GGGGGAGAAT CCATTCAAGG AGTCACCATC GCTGGCGATG AGATTCGCGA TTGGATCATT CACTGTTTGA ACGTCACCGC GGAACTCGGT GGCGTCGGGC CAATCACGAT GCAAGCGTTT CTCACTCCCG ATGGCCCAGT GCTCTCGGAG GTCAATCCGC GATTCGGTGG TGGATTTCCG CTCACACTTG CGGCTGGGGG AGCCTACCCC GATTGGCTGA TTGCAATGGT AGAGGGTGAG AAAATTGATC CCCGGTTTGG AGAGTATCGC CGCGGGCTGT ATATGACTCG CTACTATGTT GAGTTTTTCA CTGACAAACC GTTGTGGGAG GCGATGCAGT GA
|
Protein sequence | MPDRDREHSK RRSGSFVKQP AVLITSAGRR TSLLGSFVKA AQGRGWRVLA GDRDPLAPTL YLADEGLKLP SLSEHDYIPC LLELVRERSI RMIVPTIDTE LGLLARHADD FRKEGCVAVI SSGRLIEIAG DKWLTMQEYA ARAVRTPKSW LPDALPSHGL PESLFVKPRD GSASQHAYRV QSSELARKLP EVPNAIVQEE LRGNEITIDA LIDLNGQPLH YVPRVRIRTL GGESIQGVTI AGDEIRDWII HCLNVTAELG GVGPITMQAF LTPDGPVLSE VNPRFGGGFP LTLAAGGAYP DWLIAMVEGE KIDPRFGEYR RGLYMTRYYV EFFTDKPLWE AMQ
|
| |