Gene Dfer_3790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3790 
Symbol 
ID8227379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4616769 
End bp4618190 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content51% 
IMG OID644931625 
ProductAnthranilate synthase 
Protein accessionYP_003088159 
Protein GI255037538 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC TGGACAAGAC TTATCAGATC ACCACACGCC ACAAAAAGCT CCTGGCGGAT 
ACATTGACGC CCGTTTCCAT TTACCTGAAA CTGCGCGACC GTTTCGTCAA TACCATCCTG
CTCGAAAGCT CGGATTACCA CGGCAACGAA AACTCATTTA CATACATCTG CTGCGACCCG
ATCGCTTCTT TCAAACTGAA TAACAGCACG GTCACGCAGC AGTTTCCGGA CGGCGACATC
GCTACTTTTG AGCTTGCCAA CCGCAAAGAC GCCGTGCAGG CATTGTACGG CTTCGCGCAG
AGCTTTAAGT CCGAAAAAAG TAAGTTTCCC TTCATCACCA ACGGCCTTTT CGGGCATATG
ACTTATGACG CGGTGACGTA TTTCGAGGAT ATCGAAATTC AGCCTACGAA TGCCGAAACG
GAAATCGACC AGATATTCTA CCAGGTTTAC AGATACGTTA TCGCCATTAA CCACTTCAAA
AACGAGCTTT ACATTTTTGA ACATCAGTAT GGCGAGACCA CGGAAGAAGG CGGCATCGAC
CAGATCGAGG TGTTGATCAA GAACCGGAAC TTCCCGCAGT ATGGTTTCAG CATTACCGCC
GATGAGACTT CCAACGTGAC CGACGACGAA ATGCGGAAAG TGATCCAGAA AGGCATCGAT
CACTGCCTCC GTGGGGATGT ATTCCAAATC GTGCCTTCGC GCAGGTTCAG CCGCCAATTT
CAGGGCGACG AATTCAATGT TTACCGTGCA TTGCGTTCCA TCAATCCTTC GCCTTACCTC
TTCTACTTCG ATTACGGCAA TTACAAGATC TTCGGATCGT CTCCCGAAAA ACAGATTTTT
ATCAAGAACG GCCAGGCCGA AATTCACCCC ATTGCCGGCA CATTCCGCCG CACGGGCGAC
GACCAGGCTG ATGCAGAAGC CGCACAAGCC CTGCTGGACG ACCCCAAAGA GACCGCTGAG
CACGTGATGT TGGTCGATCT GGCACGGAAT GACCTGAGCC GCAGCTGCGA CGCCGTAAAA
GTCACAAATT ACAAGGAAGT TCAATACTAC TCGCACGTGA TCCACCTCGT GTCGAAGGTG
GTGGGGAAAA TGAACAAGGA CGTCAATCCA TTGCAACTGG TAGCGGACAC TTTTCCTGCG
GGCACATTAT CCGGCGCACC GAAGCATAAT GCGATGAAGC TCATCGACCA GATGGAGAAC
TGCAACCGCA GCATTTATGG CGGAGCCATC GGTTTTATGG ACTTCAATGG GGATTTCAAC
CACGCCATTG CCATCCGTAC GTTCCTCAGC AAGGATAACA CGCTTTTCTT CCGCGCCGGA
ATGGGCGTGG TCGCCAAATC GAATGTCGAA AGCGAATTGC AGGAGATTAA CAGCAAGCTG
GCCGCATTGC GCCAGGCCAT TCAAGTTGCA GAAGGACTTT AA
 
Protein sequence
MKTLDKTYQI TTRHKKLLAD TLTPVSIYLK LRDRFVNTIL LESSDYHGNE NSFTYICCDP 
IASFKLNNST VTQQFPDGDI ATFELANRKD AVQALYGFAQ SFKSEKSKFP FITNGLFGHM
TYDAVTYFED IEIQPTNAET EIDQIFYQVY RYVIAINHFK NELYIFEHQY GETTEEGGID
QIEVLIKNRN FPQYGFSITA DETSNVTDDE MRKVIQKGID HCLRGDVFQI VPSRRFSRQF
QGDEFNVYRA LRSINPSPYL FYFDYGNYKI FGSSPEKQIF IKNGQAEIHP IAGTFRRTGD
DQADAEAAQA LLDDPKETAE HVMLVDLARN DLSRSCDAVK VTNYKEVQYY SHVIHLVSKV
VGKMNKDVNP LQLVADTFPA GTLSGAPKHN AMKLIDQMEN CNRSIYGGAI GFMDFNGDFN
HAIAIRTFLS KDNTLFFRAG MGVVAKSNVE SELQEINSKL AALRQAIQVA EGL