Gene Ajs_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_4049 
Symbol 
ID4673054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp4315147 
End bp4316307 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content72% 
IMG OID639841090 
ProductFis family transcriptional regulator 
Protein accessionYP_988229 
Protein GI121596333 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCG ACGAACTGGC CGCCTGGCTG CGACTGACGC TCACGCCTGG CGTGGGCAAC 
GGTGCCGCGC GCAGGCTGCT GGCCGCGTTC GGCCTTCCCC AGAACATCTT TGCGCAGGGC
GAGAGGGCCT GGCAGTCCTG CGTCACCACG GCGCAGGCGC GCGCGCTCGC CCGCGTGCCC
GAGGAGCTGG AGGCGCTGAC CGAGCGGACG TGGCAGTGGC TGCTGGACGG GCGGGCGCCC
GCGCAGCCGG CGCACGGCAT CGTCACGCTG GGTGACGAGG CCTATCCTCC TGCGTTGCTG
GCCACCGAGG ATCCGCCGCT GCTGCTGTAC CTGCTGGGTG CTCCGCAGTT CGTGCAGGGC
GGCGCGCCGT TCGCGCCGGC GCACAGCCTG GCCATGGTGG GTAGCCGCAA CCCCACGGCC
CAGGGGGCGG ACCATGCCCG GCAGTTCGCA CGTGCTCTGC GGGAGGCGGG TTTGTGCGTC
GTTTCGGGCC TGGCGCTGGG CATCGACGCG GCGGCGCACG AGGGCGCGCT GATCGATCCG
CCCGACGGCG CCGCGCCCGC CACCATTGCC ATCGTGGGCA CCGGGCTGGA CCGGGTGTAC
CCGCGTGCCA ACAAGGAGCT GGCCCACCGC ATCGCGCGCC ACGGCTTGCT GGTCAGCGAA
TACCCGCTGG GCACGCCGCC GTTGCCGGCC AACTTTCCCA AGCGCAACCG CATCATCTCC
GGCCTGTCGC AGGGCACTCT GGTGGTGGAG GCGGCGCTGG CCTCGGGCTC GCTGATCACC
GCGCGGCTCG CGGCCGAGCA GGGGCGCGAG GTGTTCGCCA TACCCGGCTC CATCCACGCG
CCCCAGTCGC GTGGCTGCCA CGCGCTGATA CGCCAGGGCG CCAAGCTGGT GGAGTCGGCG
CAGGATGTGC TGGAAGAACT GCGCTGGCAT GCGCCGGCAG CCGCCATCCC CGCAGCACAG
GACGCGTCCG AGGAGCCCCT TGCGCCGTCT TATCAGTGTG TGCTGGATGC GCTGGGGTTC
GATCCACTGG GGCTGGACGC ACTGGTGGCG CGCACCGGGC TGGATGCCGC CACGCTGCAA
GTACGGCTGC TGGAACTGGA ACTGGAGGGG CGTGTCGCGC GCCTGCCCGG CGGGCTGTTC
CAGCGCGTGG GGCAGGCCTA A
 
Protein sequence
MQRDELAAWL RLTLTPGVGN GAARRLLAAF GLPQNIFAQG ERAWQSCVTT AQARALARVP 
EELEALTERT WQWLLDGRAP AQPAHGIVTL GDEAYPPALL ATEDPPLLLY LLGAPQFVQG
GAPFAPAHSL AMVGSRNPTA QGADHARQFA RALREAGLCV VSGLALGIDA AAHEGALIDP
PDGAAPATIA IVGTGLDRVY PRANKELAHR IARHGLLVSE YPLGTPPLPA NFPKRNRIIS
GLSQGTLVVE AALASGSLIT ARLAAEQGRE VFAIPGSIHA PQSRGCHALI RQGAKLVESA
QDVLEELRWH APAAAIPAAQ DASEEPLAPS YQCVLDALGF DPLGLDALVA RTGLDAATLQ
VRLLELELEG RVARLPGGLF QRVGQA