Gene Daud_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2033 
Symbol 
ID6026039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2140241 
End bp2141770 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID641594854 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001718155 
Protein GI169832173 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGTCG ACAAGCGGCC CTGGCCGAGC GGCCCGGAGG CGCTTTACAC CCGCCTGGAA 
AACCGGCCGG GACTGGTTAT CCTGGACAGC GGGATGGAAC CGCGGGCGGA CGCCCTGCAC
GCCTCGGGCC GGTGGTGTTT CGCCGCTTTT GATCCTTTCG CCGTTTTGGA GTGCCGGTTG
GATGGTTGCC GCCTCACTGT TGGTGGCCGC GCCCGGGAGT TCAAGGGGGA CCCCCTGGAC
ATCCTCCAGA GCATCCTGGA GGAATACGCC CTGCCGCCCG GTTCGGGACC CACGCCCCTG
CCTGCGGGCG GCATCGGGTT CTTGGCGTAC GGGCTGCGGG TTTTCCTGGA GCCGCGGCCA
ACTCGGCCCG ACGACCTGGA TCTGCCCGTG CTCCACCTTG GGTTCTACGA TGCCGTACTG
GCTTTCGACC GCCGGGATGG GTCTCTCTAC CTCACATCCA CCGGGCTGCC GGCCGGCGGG
CGCGCCCGGG CCGAACGGGC GGCGTCCAGG ATGCGTTTGC TGCGGGAAGT GGTTGATTCG
GCGGTTGAAA AACCGGCCTC CGCCCCGCCG GAGCCCGGGG TACCCGGGAG TACCCGCGGG
TACCCGGGGG TACCCGGGGG CACCCGCGTT AGCGGGTCAG CGGGTCAGCG GGTCAGCGGG
TCAGCGGGTC AGCCGGTGCC GCCCGCGGCG GTGAGTTCGA GCTTTGACCG GGCTGCTTAC
CTGGAGGCGG TGCGCCGTGT AAAGAACCAC ATTCTGGCCG GGGACGTTTA CCAGGTAAAC
TTGGCCCAAC GGTTTTCGGT TCCCTGGACG GGGTCGGCCC ACGCCCTGTT CGGCAGGCTC
TGCCGTGACA ACCCGGCCCC TTTTTCGGCG CTGATTAAGG GAGCCGGCTT CGCTGTGGTC
AGCGCCTCTC CGGAGCGTTT TCTGCACTTG AACCCCCGGA CCGGTGTGGT GCACACCCGA
CCCATCAAGG GCACCCGCCC GCGCGGCTCT TCACCGGAGA CCGACGCGCG CCTGGCCTGC
GAACTGCTGG CCAGCGAGAA GGACCGGGCC GAGCACATAA TGATCGTTGA CCTGGAGCGC
AACGACCTGA GCCGCGTCGC GCAGCCGTCA TCGGTCCGGG TTCCGGAAAT GCTGGTTCTG
GAGCCTTTCC CGACGGTCTG GCACCTGGTG TCGACGGTGG AGGCCGAACT GCGGCCGGGG
ACCGGCGTCG CCGACCTCTT ACGCGCCGCC TTTCCGGGCG GTTCGATCAC GGGCGCGCCC
AAGATCCGGG CCATGGAGAT CATCGAGGAA CTGGAACCGG TACCCCGGGG CGTGTATACC
GGTGCCGCCG GTTATTTCAG CTTCGACGGC CATCTGGACC TGAACATCAC CATCCGGACC
ATCGTGCTCC GCGGCGGACG GGCGTGGTTC CACGTCGGCG GCGGGATCGT GGCCGATTCG
GAGCCGGAGG CCGAATACCG GGAGACACTA GATAAAGCGC GAGCGCTGTT TGCGGCGCTG
GGGAGCCCGG GGCGCCGTGA AGGAGGATGA
 
Protein sequence
MLVDKRPWPS GPEALYTRLE NRPGLVILDS GMEPRADALH ASGRWCFAAF DPFAVLECRL 
DGCRLTVGGR AREFKGDPLD ILQSILEEYA LPPGSGPTPL PAGGIGFLAY GLRVFLEPRP
TRPDDLDLPV LHLGFYDAVL AFDRRDGSLY LTSTGLPAGG RARAERAASR MRLLREVVDS
AVEKPASAPP EPGVPGSTRG YPGVPGGTRV SGSAGQRVSG SAGQPVPPAA VSSSFDRAAY
LEAVRRVKNH ILAGDVYQVN LAQRFSVPWT GSAHALFGRL CRDNPAPFSA LIKGAGFAVV
SASPERFLHL NPRTGVVHTR PIKGTRPRGS SPETDARLAC ELLASEKDRA EHIMIVDLER
NDLSRVAQPS SVRVPEMLVL EPFPTVWHLV STVEAELRPG TGVADLLRAA FPGGSITGAP
KIRAMEIIEE LEPVPRGVYT GAAGYFSFDG HLDLNITIRT IVLRGGRAWF HVGGGIVADS
EPEAEYRETL DKARALFAAL GSPGRREGG