Gene BBta_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4141 
Symbol 
ID5151240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4348266 
End bp4349393 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID640558971 
Producthypothetical protein 
Protein accessionYP_001240109 
Protein GI148255524 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0877657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAC CTCATCTCAC GATCGCCGAT GAAGTGCAGG CGGCGATTGC CACCGGCTCA 
GCCGAACGCT GCTCAGTGGT GGCCGAGCGG GTCGCGTCCC TGTTCATCGC CTCGGCAGGC
AACATGGATA TCGAGCAGCA TGCGCTGTTC GCAGACGTTT TCGAGCGCCT CGTCAACACG
ATCGAGCTGC GTGCGCTCGC CGATGTCAGC GCACGGATCG CGCTCGCCGA GCTCAGCGCG
CAGCTTGCGC CGGTGCCTCA GGCGCCCGTC GCGGTGATCC GGCGGCTTGC CCGTCATGAA
GATATTGCCG TTGCCGAGCC GGTGCTCGCG GAGTCACCGC GGTTGAGCAA CGCGGATCTC
ATCGAGATCG CGAACACCCG CAGCGAACAG CATCTGATCG CGATCGCCGG CCGCTGGTGG
CTGCAGGAAG TCGTCACCGA TGCCTTGCTG GGGCGGCGTT TTCCCAGCGT GAGCCGCAAG
CTGATGAAGA ATCCCGGCGC GCGGATCTCC GCGGCTGGCT TTTCCATCAT CCTGTCCCAG
GCGATCAACG ATCCGGAGCT CACGATCGCC ACCGGCATAC GCGCCGATCT GCCGGCCGGG
TTGCGCAGGA CGCTGTTGCA GAGCGCGACC GAGGCGGTCA AAGCCCGCCT TCTCGCGTCG
GCGCCGCCGC ATCTCTATGA GGAAATCCGA AGCGCGATTG CGGCCGCCGC CGCCGGCGCT
GAGCGGGACA TGGCGCGACA ACGCGATTTC GGCAGCGCCA AAGCGGCGTT CGGGCAGCTG
CGGCAGACCG GCAGGCTGAA CGAGACCATG CTGCTCGATT TCGCCAGGCA GCGCCGCTAC
GTGGAGACGA CAGCGGCGAT TGCGGAGCTC GCAAAATGCA GCATCGATCT GGTGCGGCCG
CTGATGCAGA GCCTGCGCAG CGATGGCATT CTCGTTCCCT GCAAGGCGGC GGGACTGAGC
TGGGACACGG TGGTGGCCAT TCTCGACAGC CGCTTCGTCT CGGGCGCGAC GCCGCCTGAC
GAACTCGCCA AGCTCAAGAC CAAATACCGT GCGCTGACCG CCGACGAGGC CCAGCGCACG
CTCAATTTGT GGAATGTCAG GACAGCGGCC CCGGCCAAGT CGATTTGA
 
Protein sequence
MSKPHLTIAD EVQAAIATGS AERCSVVAER VASLFIASAG NMDIEQHALF ADVFERLVNT 
IELRALADVS ARIALAELSA QLAPVPQAPV AVIRRLARHE DIAVAEPVLA ESPRLSNADL
IEIANTRSEQ HLIAIAGRWW LQEVVTDALL GRRFPSVSRK LMKNPGARIS AAGFSIILSQ
AINDPELTIA TGIRADLPAG LRRTLLQSAT EAVKARLLAS APPHLYEEIR SAIAAAAAGA
ERDMARQRDF GSAKAAFGQL RQTGRLNETM LLDFARQRRY VETTAAIAEL AKCSIDLVRP
LMQSLRSDGI LVPCKAAGLS WDTVVAILDS RFVSGATPPD ELAKLKTKYR ALTADEAQRT
LNLWNVRTAA PAKSI