Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1919 |
Symbol | |
ID | 6463197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 2011098 |
End bp | 2012225 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642728128 |
Product | homocitrate synthase |
Protein accession | YP_002018758 |
Protein GI | 194336964 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00307593 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGGC CCTGGATTAT CGACACAACA CTTAGAGATG GCGAACAGGC TCCCGGTGTT GTCTTCAGCA ATACGGAAAA AATTGACATT GCCTCTCTGC TTGCTGATGC GGGAGTCAAT GAACTTGAAG TCGGATACCC AGCCATCAGT CATGATGAAC GCAAGAGCAT ACAGAAAATC GTTACACTGA ACCTGCCGGT ACGCCTGACC AGTTGGGCTC GGGCAAAATG GCAGGATATT GAGGATGCCT GTACCTGCGG GACCGAAGCG GTTCACATCA GCTTTCCTGT TTCGGCGATG TATCTGGAGC TTATGGGAAG GGATTATGTA TGGGTACAGC AACAGTTGCA GGAACTTGTT CCACGGGCAA AAAAATATTT TAGTTATGTG AGCGTTGGGG CACAGGATGC CACAAGGGCA GATGCTGAAC TGCTCGAAAG CTTTGTGCTT GATGCTGAAG CATGTGGCGC GGACAGGGTA AGAGTTGCCG ATACGGTCGG TATCGCAACA CCAACATCCC TGATCGGGCT GATACACCGC CTGAAATCAG TAAGCCATGC GGCGCTTGAA TTTCATGCCC ATAATGATCT CGGTATGGCA ACAGCCAATG CTTATACTGC CCTTGAAGCG GGATGCCAGG CGGTCAGTGT TTCCGTTACC GGACTTGGTG AGCGAGCGGG GAATGCTGCG CTCGAAGAGC TTGCCATAGC CCTGAAGCTC TCAGGAAAAT ATGAAACAAG CATTGATACC AGAAAGCTCT CATCGCTCTG CACAGCAGTA AGCAAAGCAT CCGGAAGAGA GATCGTAGAT CAAAAACCCG TTGTAGGCAA ATCGGCATTC CAGCATGAAT CAGGTATTCA CTGTGCAGCG TTGCTGAAAC ATCCGCTCTC CTACCAACCA TTTCTTCCAG ATCAGGTTGG CCGTAACGGG CATGAGATAG TGATCGGAAA ACATTCGGGA AGCGCGTCGC TGCAGCACTA TTTTTCAGGC AAGGGCATCA CCATGACAAG AGGCGAAGCA AACCATCTGC TTGCTCTTGT TCGTACAACC GCCACTGAAA AAAAACGGGC ATTGCAGCCC GATGAGCTTG AACACATCTA CAACACACTC TTGGTGAAAC ACGGATAA
|
Protein sequence | MIRPWIIDTT LRDGEQAPGV VFSNTEKIDI ASLLADAGVN ELEVGYPAIS HDERKSIQKI VTLNLPVRLT SWARAKWQDI EDACTCGTEA VHISFPVSAM YLELMGRDYV WVQQQLQELV PRAKKYFSYV SVGAQDATRA DAELLESFVL DAEACGADRV RVADTVGIAT PTSLIGLIHR LKSVSHAALE FHAHNDLGMA TANAYTALEA GCQAVSVSVT GLGERAGNAA LEELAIALKL SGKYETSIDT RKLSSLCTAV SKASGREIVD QKPVVGKSAF QHESGIHCAA LLKHPLSYQP FLPDQVGRNG HEIVIGKHSG SASLQHYFSG KGITMTRGEA NHLLALVRTT ATEKKRALQP DELEHIYNTL LVKHG
|
| |