Gene BBta_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4022 
Symbol 
ID5152472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4226022 
End bp4227197 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID640558853 
Productputative cysteine synthase 
Protein accessionYP_001239994 
Protein GI148255409 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.504798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCAGG GATCGTGGTC GGTTGCCGGG ATGATCTCGC CTGCCTCCCG TGCTTCGCAA 
GTCTCGCTGC CTCGCTACCG CCGCGTCTGG GTCGACGATG CCGTGGCCGC GATCGAGGCT
GATCAGTGCC GGACAGCTGA TACGCATCTG ATCCGCCTCA TCGTGCCGGC GCTGGCGGGC
ATCGACATTT ACCTGAAGGA CGAATCGACG CATCCGACCG GCAGCCTGAA GCATCGGCTC
GCCCGCTCGC TATTTCTCTA TGCGCTCTGC AATGGTCATA TTCGCGAAGG CACGCCGGTG
GTCGAGGCGT CCTCGGGGTC GACCGCGGTG TCGGAGGCCT ATTTCGCGCA GATGATCGGC
GTGCCGTTCT ATGCAGTGAT GCCGCGCACC ACCTCGCCTG AGAAGATCGC GGCGATCACC
CATTATGGCG GCAATTGCCA CCTGATCGAT GATGGCCGGG CGCTCTATGC CGAGGCGGCT
GCGCTCGCGG CGCGGCTCGG TGGTCATTAC ATGGATCAGT TCACCTTCGC CGAGCGCGCC
ACCGATTGGC GCGGCAACAA CAATATCGCC GAATCGATCT TCAACCAGCT GCAGGGCGAG
CGCTGTCCAC TGCCGGAATG GATCGTGATG GGCGCCGGCA CCGGCGGCAC GTCGGCGACC
ATGGGCCGTT ATTTGCGCTA TCGCCGTTAT CCCACGCGGC TCTGCGTCGC CGACGTCGAG
CATTCCGCCT TTTTCGATGC CTTCTGCTCG GGCGATGTCC GGCAGACCTG CGAGAGGCCG
TCCCTGATCG AGGGCGTCGG CCGGCCGCGT TGCGAGCCGT CCTTCGTGCC GGGGGTAGTC
GACCGCATGA TCAAGGTGCC GGACGCGGCC TCGATCGGGG CGATGAGCGT GCTGACAAGG
CGGCTGCGCC GGCCGGTCGG CGGCTCGACC GGGACCAACT TCCTGGCGCT ATGCCGGCTT
GCCTCCGAGA TGCGCGAGGC CGGTGTGATC GGATCGGTCG TGACGTTGAT CTGCGACTCT
GGCGAGCGCT ACCGCCAGAC CTATTACGAT CCGCAATGGC TGGCGGCGCG CGGCCTCGAT
CCGGCCCCCT ATGACGCGGC GCTGTCCGCT TTTCTCGACA CCGGCGCGCC GCTCCGCCTC
GCCATTCCCG ACGCCGTCAA TCCGCGAAGT GACTGA
 
Protein sequence
MRQGSWSVAG MISPASRASQ VSLPRYRRVW VDDAVAAIEA DQCRTADTHL IRLIVPALAG 
IDIYLKDEST HPTGSLKHRL ARSLFLYALC NGHIREGTPV VEASSGSTAV SEAYFAQMIG
VPFYAVMPRT TSPEKIAAIT HYGGNCHLID DGRALYAEAA ALAARLGGHY MDQFTFAERA
TDWRGNNNIA ESIFNQLQGE RCPLPEWIVM GAGTGGTSAT MGRYLRYRRY PTRLCVADVE
HSAFFDAFCS GDVRQTCERP SLIEGVGRPR CEPSFVPGVV DRMIKVPDAA SIGAMSVLTR
RLRRPVGGST GTNFLALCRL ASEMREAGVI GSVVTLICDS GERYRQTYYD PQWLAARGLD
PAPYDAALSA FLDTGAPLRL AIPDAVNPRS D