Gene Ajs_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_0035 
Symbol 
ID4672168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp36284 
End bp37504 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID639837168 
Productsulfite dehydrogenase (cytochrome) subunit SorA apoprotein 
Protein accessionYP_984367 
Protein GI121592471 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.7902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.133868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCC AAGCTCCCGC CTCCCTGCCC CGCCGCCGCC TGCTGGCGGG CAGCGCCAGC 
GCGCTGGCCG CCGCCGGCCT GGCCAGCTTC CACCAGGGCG CCGCGGCGCA GTCGGCCGCG
CCGGCCGCGG CCAAGCCGCT GCCCGGCTAC GCCGGCTGGA AGAACGCCGA TGCCGTCATC
GTGCACAGCA GCACCACCAT CGAGACGCGC CGCGGCGCCT TTGGCACCAG CGTCATCACG
CCCTCGGACC AGCTGTACGT GCGCAACAAC CTGCCCACGC CGCCCGAGTC CATCGTCGCT
GACCGCGACG CCTGGCAGGT GCAGGTGAGC GGGGTAAAGG AGCCGCGCCG CCTGTCGGTG
CGCGAGCTCA AGGCCATGGG GCTGGAGACG GTGACCATGG TGCTGCAGTG CTCGGGCAAC
GGCCGGGGCT TCTTCCCCAG CAAGCCCAGC GGCACGCCCT GGACGGTGGG CGCCGCCGGC
TGCGTGGTCT GGAGCGGCGT GCCCGTGCGC GACGTGGCGC GCGCCCTGGG TGGCGTGGCT
GACGGCATGA AGTACATGAC CGGCACCGGC GGCGAGGTGC TGCCCGCCGG CATCGACCCC
AAGACGGTGA TCGTCGAGCG TTCGGTGCCG CTGGAGGCCA TGCAGGATGC GCTGCTGGCC
TGGGAGATGA ACGGTGAGCC CATACCGCTG GCGCACGGCG GGCCGCTGCG CCTGATCGTG
CCGGGCTACA CCGGCGTGAA CAACATCAAG TACATCGGCC AGCTCGCCTT CACCGACAAG
GAGAGCGAGG CGCGCATCAT GAGCCACGGC TACCGCATCT CGCCGCCGGG CGGCAAGGGC
GACCCCAGCC AGCCGTCGGT GCAGCAGATG AGCGTCAAGT CCTGGATCAA CGGCCCGCTG
CCCGAAGATG GCGAGCTGGC CCCGGGCCGC GTGCAGATCC ACGGCGTGGC CTTCGGCGGC
ATGCACGCCG TCAAGGGTGT GGAGGTGTCC GTCGATGGCG GCAAGACCTG GCAGGCCGCG
CGCCTGGTGG GCCCGGACAT GGGCCGCTAC GCCTGGCGCC AGTTCGTGCT GCAGGCCGAC
CTGCCGCGCG GCAGCCACAC CCTGGCCAGC CGAGCCACCG ATGCTCAGGG CAACGTGCAG
CCCGAGCAGC GCGAGGAGAA CCAAGCCGGC TACAACAACA GCAGCTGGGC GGACCACGCG
GTGACCGTCA AGGTGGCCTG A
 
Protein sequence
MHTQAPASLP RRRLLAGSAS ALAAAGLASF HQGAAAQSAA PAAAKPLPGY AGWKNADAVI 
VHSSTTIETR RGAFGTSVIT PSDQLYVRNN LPTPPESIVA DRDAWQVQVS GVKEPRRLSV
RELKAMGLET VTMVLQCSGN GRGFFPSKPS GTPWTVGAAG CVVWSGVPVR DVARALGGVA
DGMKYMTGTG GEVLPAGIDP KTVIVERSVP LEAMQDALLA WEMNGEPIPL AHGGPLRLIV
PGYTGVNNIK YIGQLAFTDK ESEARIMSHG YRISPPGGKG DPSQPSVQQM SVKSWINGPL
PEDGELAPGR VQIHGVAFGG MHAVKGVEVS VDGGKTWQAA RLVGPDMGRY AWRQFVLQAD
LPRGSHTLAS RATDAQGNVQ PEQREENQAG YNNSSWADHA VTVKVA