Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2771 |
Symbol | |
ID | 5900226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3010599 |
End bp | 3012134 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563263 |
Product | anthranilate synthase component I |
Protein accession | YP_001684396 |
Protein GI | 167646733 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00110459 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000354378 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCTGG AGCCCAGCCT AGAGGCCTTT ACCGCCGCCT ATGACGCCGG CCGCCCGCAG GTGGTCTGGA CCCGGCTGAT CGACGATCTG GAGACCCCGG TCTCGGCCTA TCTGAAGATC GGCTACGGCC GGCCCTACGC CTTCCTGTTC GAGAGCGTCG AGGGCGGGGC CTGGCGCGGG CGCTATTCCA TCGTGGCGAT GAAGCCGGAC CTCGTCTGGC GCTGTCGCGG CGACCAGGCC GAGATCGCCG AGGGCGACGA CATCGCCGCC CGCCGGTTCA CGCCCCAGGC GGGCGGCGCG CTGGACAGCC TGCGCGACCT GAGCGCCCAG TCGCGGATCG AGCTGCCGGC CGGCCTGCCG CCGCCGTCGG CCGGGGTGTT CGGGGCCCTG GGCTACGACA TGGTCCGCCT GGCCGAGCGC CTGCCTGATG TGAACGAAGA TAGTCTGGGC CTGCCGGACG GGATCATGAC CCGGCCGTCG ATCGTGGCGG TGTTCGACGC CATCGCCCAG GAGATCATCC TGGTCACCAC GGCGCGGCCC AAGGCCGGGG TCTCGGCCGA CGAGGCCTAC GCCGCCGCCC GCGCCAGGAT CGAGACCGTG ATGGTCGACC TGCGCCAACC GCTGGCCCAC GACGCGCCGC GACCCAGCCA TGGCCCGATG GACTTCACCA CCCCGGTCAG TCGCGCCGAC TACGCCACGA TCGTCGAGAA GGCCAAGGAA TACATCCGCG CCGGCGACAT CTTCCAGGTG GTGCCCAGCC ACCGCTTCCG CGCGCCCTTC CCGCTGCCGC CGTTCGCGCT CTACCGCTCT CTGCGACGGA CCAATCCCTC GCCGTTCCTC TATTTCCTGG ACCTCGACGG TTTCGAGTTG GTGGGGTCGT CGCCGGAGAT CCTGGTCCGG CTGCGCGACG GCAAGATCAC CATCCGTCCG ATCGCCGGCA CGCGCCCGCG CGGCGCCACG CCGGAGGAGG ATCTGGCGCT GGAGAGGGAG CTGCTGGCCG ATCCCAAGGA GCGCTCCGAG CACCTGATGC TGCTGGACCT GGGCCGCAAC GACGTCGGTC GGGTCGCGAT GCTCAAGCAC GCCGGCAGCA ACGATCCGGC CGCCAAGGGC AAGCACGCCA ACGTCCGGGT CACCGACAGC TTCAAGATCG AGCGCTACAG CCACGTCATG CACATTGTCT CCAATGTCGA GGGCGACGCG CCCGAGGGCG TGGACCCGGT CGACGTGCTG ATGGCCGCCC TGCCCGCCGG CACCCTGTCG GGCGCGCCCA AGGTGCGGGC CATGGAGATC ATCGACGAGC TGGAGATCGA GAAGCGCGGC GTCGGCTACG CCGGAGCGGT CGGCTATATC GGCGCCGACG GCTCAGTCGA CACCTGCATC GTGCTGCGCA CGGCCCTGGT CAAGGACGGC ATGATGTACG TCCAGGCCGG CGGCGGGATC GTCGCCGACA GCGACCCGGA CGCCGAATAT GACGAGACCC TGCACAAGTC GCGGGCGCTG AAGCGCGCGG CGGAAGAGGC CTGGCGCTTC GCTTGA
|
Protein sequence | MSLEPSLEAF TAAYDAGRPQ VVWTRLIDDL ETPVSAYLKI GYGRPYAFLF ESVEGGAWRG RYSIVAMKPD LVWRCRGDQA EIAEGDDIAA RRFTPQAGGA LDSLRDLSAQ SRIELPAGLP PPSAGVFGAL GYDMVRLAER LPDVNEDSLG LPDGIMTRPS IVAVFDAIAQ EIILVTTARP KAGVSADEAY AAARARIETV MVDLRQPLAH DAPRPSHGPM DFTTPVSRAD YATIVEKAKE YIRAGDIFQV VPSHRFRAPF PLPPFALYRS LRRTNPSPFL YFLDLDGFEL VGSSPEILVR LRDGKITIRP IAGTRPRGAT PEEDLALERE LLADPKERSE HLMLLDLGRN DVGRVAMLKH AGSNDPAAKG KHANVRVTDS FKIERYSHVM HIVSNVEGDA PEGVDPVDVL MAALPAGTLS GAPKVRAMEI IDELEIEKRG VGYAGAVGYI GADGSVDTCI VLRTALVKDG MMYVQAGGGI VADSDPDAEY DETLHKSRAL KRAAEEAWRF A
|
| |