Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1143 |
Symbol | |
ID | 4078439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1229411 |
End bp | 1230922 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006447 |
Product | anthranilate synthase component I |
Protein accession | YP_613138 |
Protein GI | 99080984 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0793229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGA TCCCCGATTT CGACAGTTTC GCCAAAGCCT ATGAGGCGGG CGAAAACCAG GTGGTCTACA CACGGCTTGC CGCCGATCTG GATACGCCCG TCTCCTTGAT GCTGAAGCTC ACCGGCGCGC AGAAGGATGC CTTCATGCTG GAATCGGTGA CCGGCGGCGA GGTCCGCGGG CGCTATTCCA TCATCGGCAT GAAGCCTGAC CTGATCTGGC GTGCGCGGGG CGAACAGGCC GAAATCAACC GCGCCGCGCG GTTCGATCCC GAGGGCTTTG CCCCGCTCGA CGGCAACCCA CTTGATACAC TGCGCGCGCT ACTGGCCGAG AGCCGCATCG ACCTGCCGGA TGATCTGCCA CAGGCGGCGG CGGGGCTGTT TGGCTATCTG GGCTATGACA TGATCCGCCA TGTGGAGCAT CTGCCGGATG TGAACCCCGA CCCGCTTGGC CTGCCGGACG CGGTGATGAT CCGCCCCTCC GTGGTGGCCG TGCTGGACGG TGTCAAAGGC GAGGTCACAG TGGTCTCTCC CGCCTGGGTC AGCGAAGGCC AGTCGGCGCG CGCGGCCTAT GCTCAGGCTG CCGAACGCGT GATGGATGCG GTGCGTGATC TTGAACGTGC CATGCCCGCC GAGACCCGCG ATCTGGGCGA GGCGCGCGAG GTCGCGCCCC CGGTCTCCAA CTTCACCAAG GACGGCTACA TGGCCGCCGT GGAGAAGGCC AAGGACTACA TCCGTGCCGG CGACATCTTT CAGGTAGTGC CCGCACAGCG CTGGACGCAG GAGTTCCCGC AGCCGCCCTT CGCGCTCTAT CGTTCGCTGC GACGCACCAA CCCCTCGCCG TTCATGTTCT ACTTCAACTT CGGCGGTTTT CAGGTGATCG GCGCCAGCCC CGAGATCCTC GTTCGGGTCT TTGGCAACGA GGTCACCATT CGCCCCATTG CTGGCACCCG TCCGCGCGGC GCAACCCCCG AAGAAGACAA AGCGCTGGAA CAGGATCTGC TTGCCGACAA GAAAGAGTTG GCCGAGCACC TGATGCTCTT GGATCTGGGC CGTAACGACG TGGGCCGCGT TGCCAAGATC GGCACCGTGA AACCCACCGA GGAATTCATC ATCGAGCGCT ACAGCCACGT GATGCATATC GTTTCGAATG TTGTTGGCGA ACTCCACGAG GACAAAGACG CGCTCGATGC ATTTTTTGCA GGCATGCCTG CGGGTACGGT TTCCGGAGCG CCCAAGGTGC GTGCGATGGA GATCATCGAC GAGCTCGAAC CCGAAAAGCG CGGCATCTAT GGCGGTGGCG TCGGCTATTT CAGCGCTGGC GGCGACATGG ACATGTGTAT CGCGCTGCGC ACAGCCATCG TGAAGGATCA GAACCTCTAT ATTCAGGCCG GGGGCGGCGT CGTCTATGAC AGCGACCCGG AGGCCGAATA TATGGAGACC GTGCATAAAT CGAACGCGAT CCGCCGTGCG GCTGCAGATG CGGCGCGCTT TACCGGCAAC GGCAACCGCT GA
|
Protein sequence | MALIPDFDSF AKAYEAGENQ VVYTRLAADL DTPVSLMLKL TGAQKDAFML ESVTGGEVRG RYSIIGMKPD LIWRARGEQA EINRAARFDP EGFAPLDGNP LDTLRALLAE SRIDLPDDLP QAAAGLFGYL GYDMIRHVEH LPDVNPDPLG LPDAVMIRPS VVAVLDGVKG EVTVVSPAWV SEGQSARAAY AQAAERVMDA VRDLERAMPA ETRDLGEARE VAPPVSNFTK DGYMAAVEKA KDYIRAGDIF QVVPAQRWTQ EFPQPPFALY RSLRRTNPSP FMFYFNFGGF QVIGASPEIL VRVFGNEVTI RPIAGTRPRG ATPEEDKALE QDLLADKKEL AEHLMLLDLG RNDVGRVAKI GTVKPTEEFI IERYSHVMHI VSNVVGELHE DKDALDAFFA GMPAGTVSGA PKVRAMEIID ELEPEKRGIY GGGVGYFSAG GDMDMCIALR TAIVKDQNLY IQAGGGVVYD SDPEAEYMET VHKSNAIRRA AADAARFTGN GNR
|
| |