Gene BBta_4972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4972 
Symbol 
ID5156000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5209480 
End bp5210592 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content61% 
IMG OID640559758 
Productputative arylsulfatase regulatory protein 
Protein accessionYP_001240887 
Protein GI148256302 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGA CGGCGTTTTG CAACGTCGAT TGCGACTATT GCTACCTTCC TAACCGAACC 
GACCCGCGGA TCATGAGCCA CGATATCGTC GCTGCGGCGG CAGATTTCGT GTTCCAGGGA
GGGCTGGACG CGAGCGATTT CACCGTAGTG TGGCATGCCG GCGAACCCTT GGTCGTTCCA
CCGTCGTGGT ATCGTGAAGC GTTTGCGAGG ATTGGAGCGG CGGCACCGGC GAACAAGGCC
GTTCCTCATG CCATCCAGAC GAATGGGATG CTGATCAATG ACGATTGGTG CGACCTGTTT
CTCGCGCACG GCGTTCGAGT CGGTGTCAGC ATCGATGGTC CTGCGTTCTT GCACGATGCA
CGTCGCCGCA CCCGCTCCGG TAAGGGGACA CATGCTGCGG CGCTGCGTGG GCTGCGGAAG
CTCAGGGAGC GCGGCGTGCC AAGCCACGCT ATCTGCGTCG TCACCAATGC AACCTTGCCG
CATGCGCGCG AACTCATTGC TTTCTTCAAT GAAGAAGGCG TCACGGATCT GGGCTTCAAT
ATCGAGGAAG TGGAAGGCGC CAACACGGCG TCAAGCCTGG CACGCCCCGG CTCGATCGAG
GATTTTCGGG CGTTTTTCGA AGGCGTGCTG GAAGCTGCCG ACAGCGCGAG TCCGCCGCTG
CGCATCCGCG AGTACCGAAA CATGCTTGCA ATGCTCAAGC ACCCGGCCTT TGGCCGCTTG
AACGCCAATT CCCAGAACAT GCCCTTCGCC ATGCTGACTG TCGCGACCGG CGGTGAGCTT
TTCACCTTCT CGCCGGAATT GGCCGGCTTG CTGCATCAGG ACTACGGCAA CTATGTTGTG
GGCCGGTTGC CGCAAGCGCG TCTGGGTGAC GTGCTCGCCA ATCCGGTATT TCGCCGCATG
CTCGACGACA TCTGGGAGGG GATCGCGCTG TGCCATCAGA GCTGCCGATA TTTCGACATT
TGCCTCGGTG GGTCGCCCGT CAACAAGATG TCGGAGTGCG GGAGCTTCGT GGCTACGGAA
ACCCTTGCCT GCAAGCTGGT CCATCAGGTC GTCGCCGACG TATCGCTCGC CCATCTGGAT
CGGCGGATGT CGGACGAGCG TATTGGGGCC TGA
 
Protein sequence
MQPTAFCNVD CDYCYLPNRT DPRIMSHDIV AAAADFVFQG GLDASDFTVV WHAGEPLVVP 
PSWYREAFAR IGAAAPANKA VPHAIQTNGM LINDDWCDLF LAHGVRVGVS IDGPAFLHDA
RRRTRSGKGT HAAALRGLRK LRERGVPSHA ICVVTNATLP HARELIAFFN EEGVTDLGFN
IEEVEGANTA SSLARPGSIE DFRAFFEGVL EAADSASPPL RIREYRNMLA MLKHPAFGRL
NANSQNMPFA MLTVATGGEL FTFSPELAGL LHQDYGNYVV GRLPQARLGD VLANPVFRRM
LDDIWEGIAL CHQSCRYFDI CLGGSPVNKM SECGSFVATE TLACKLVHQV VADVSLAHLD
RRMSDERIGA