Gene AnaeK_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_3994 
Symbol 
ID6785464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4505387 
End bp4506868 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content77% 
IMG OID642765463 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002136328 
Protein GI197124377 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.906481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGCTC CGCGCGCGGT GACGTTCCGG GCCGAGGTCG AGGCGCTCGA CCTCGGCGTC 
GAGGGGTTCG AGCGGTGCGT GGACGCGCTG CGGCGCCGGC CCGGGCTCGT CCTCCTCGAC
AGCCGCGTGG TGGACGGGCG CCTGGGCCGG TTCTCGTTCG CCTGCTTCGA GCCGTTCGCG
ACGCTGATCG CCCGCGACGG CCGGGTGGAG CTGCGGCGCT GGACCGGCGA GCGGGACACC
CTCCGCGGCA AGGCGCTCGA CGTCCTCGAG CGGCTCCTCT CGGCGCACCG CCTCGAGGTG
GACGCGGGCG GGCTGCCCGC GCCGTTCGTG GGCGGGGCCG CCGGCTACCT CGGGTACGGG
CTCGCCCGCG AGCTGGAGCG TCTCCCGCGC GCCGCCCGCG ACGCGTCCGG CGCGCCCGAC
GCCGTGCTCG GCCTCTACGA TCGCGTGCTC GTCCTCGATC GCGTGGCGCG CCGCACGCAC
CTCTCGTGCC TCGCGTCCCC CGACCTCCCC GGCCGCGCGC CCTTCGACGA GGTGCGGCGC
GCGGTGCTCG AGGCCGCGCG GCAGGGCACC GCGGCGGTGG ATGCCGCGCC GGAGCCGTCG
CGCGCGGGAG GCGAACCGGT CGAGCCGGTC GAGTCGGACG AGGAGCCGCT GCTGCGCGAC
CTGACGCGCG AGGCGTACCT CGCGTCGGTG CGGCGCATCC AGGACTACGT CGCCGCCGGC
GACGTGTACC AGGTGAACTT CACCGGGCGC TGGTTCGCGC CGGTGCGCGG GCGCGATCCC
TGGGCGCTCC ACCGCCGCCT CATGCGGCTC AACCCGGCGC CGTTCGCCGC CTGGCTCGGG
TTCGACGCGG TGCAGGTGTC GTGCGCCTCG CCGGAGCGCT TCCTGCGCGT GGACGGCGCC
GAGGTGGAGA CCCGGCCCAT CAAGGGGACC GCGCCCCGCG GGCGGACGCC GCAGGACGAC
GCGCGCCTGC GCGCGGCGCT GCTCGCGAGC GCGAAGGACC GCGCCGAGCT GGCGATGATC
GTGGACGTCG CGCGCAACGA CCTCGGGCGC GTGTGCACCC TCGGCTCGGT GCGGGTGGAC
GCGTTCCCGG AGGTCGAGCG CCACCCTTCG GTCCACCACC TCGTCGCCAC GGTGCGCGGC
CGGCTCGCGC CCGGGCGCGG CGCCTGCGAC CTCCTGCGCG CCGCGTTCCC CGCCGCGTCC
ATCACCGGCG CGCCCAAGAT CCGGGCCATG GAGATCGTGG AGGCGCTGGA GCCGGTCGCG
CGGGGCGTGT ACACCGGCAG CATCGGCTAC CTCGGCTTCC AAGGCACCGC GGACCTGAAC
GTCGCCATCC GCACCCTCGT GGTCGCGGGC AGCGCCGTCC ACCTCCACGC CGGCGGCGGC
ATCGTGGCGG ACTCCGTGCC GGAGGCCGAG CACGACGAGG CGGAGCTGAA GGCGCGCAAC
CTCGTCCGCG CCGTGGCGGG CTGGCACGAG GTGGCGCGGT GA
 
Protein sequence
MLAPRAVTFR AEVEALDLGV EGFERCVDAL RRRPGLVLLD SRVVDGRLGR FSFACFEPFA 
TLIARDGRVE LRRWTGERDT LRGKALDVLE RLLSAHRLEV DAGGLPAPFV GGAAGYLGYG
LARELERLPR AARDASGAPD AVLGLYDRVL VLDRVARRTH LSCLASPDLP GRAPFDEVRR
AVLEAARQGT AAVDAAPEPS RAGGEPVEPV ESDEEPLLRD LTREAYLASV RRIQDYVAAG
DVYQVNFTGR WFAPVRGRDP WALHRRLMRL NPAPFAAWLG FDAVQVSCAS PERFLRVDGA
EVETRPIKGT APRGRTPQDD ARLRAALLAS AKDRAELAMI VDVARNDLGR VCTLGSVRVD
AFPEVERHPS VHHLVATVRG RLAPGRGACD LLRAAFPAAS ITGAPKIRAM EIVEALEPVA
RGVYTGSIGY LGFQGTADLN VAIRTLVVAG SAVHLHAGGG IVADSVPEAE HDEAELKARN
LVRAVAGWHE VAR