Gene BBta_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4043 
Symbol 
ID5152501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4241958 
End bp4243115 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID640558876 
Producthypothetical protein 
Protein accessionYP_001240015 
Protein GI148255430 
COG category[S] Function unknown 
COG ID[COG4645] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0814624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA ATGCGACGCT GCCCGAGAGA GGGCGCGACC TGCGGCTCGA CCTGTTCCGC 
GGCATCGCCA ATTGGGCGAT ATTTCTCGAT CATATCCCCG ACAATGTGGT GAACTGGATC
ACGACCCGGA ATTACGGCTT CAGCGACGCC GCTGACCTGT TCGTGTTCAT CTCCGGCTAT
ACCGCGTCCT TCGTCTATGC CCGCATGATG ATCGACCGCG GCTTCATCGT CGGCGCCACC
CGCCTGTTCA AGCGGGTGTG GCAGCTCTAC GTCGCTCACA TCGTGCTGTT CGTCATATAT
ATTGTGGCCA TCAGCTATCT GGCGACACGC TTCGGCGTCT CCGAGATCAT CGACGAGTTC
AACGTTGCCG GACTGGTCGA CCATGCCAGC GATACGCTGG CGCAGGGGCT CATCCTGAAG
TTCAAGCCGG TCAATCTCGA CGTGTTGCCG CTCTATATCG TGCTGATGGG TTTCTTTCCG
CCGGTGCTGT GGCTCATGCT GCGGCAGCCG GATATCACGA TGATCGCCTC GATCGTGCTT
TGGCTCCTCG CGCGCCAGAT GGGGTGGAAT TTCGCCGCCT ATCCGGCCGG CACTTGGTAT
TTCAATCCGT ATTGCTGGCA GGTGCTGTTC GTGTTCGGCT CGTGGTGCGC GCTCGGCGGC
GCGCGCCGCT CGATGGGCAT CATCATGGCC CCGGCGACAC TCTATTTCTG TCTGGGCTAC
CTGCTGCTCG CATTGATCAT GACCATGGCC GGCCGCTTTC CGGACTATGG CACGATGTTG
CCGCACTGGC TCTATTCGGC GTTCAATCCG AACGACAAGA CCAATCTCGC GCCCTACCGT
TTCCTGCATT TCGTGGTGAT CGTCATCCTG GTGATCCGCT TCGTGCCGAA GGAATGGCCG
GGCCTGGAAT GGAAGGGCTT CGATCCGCTG GTGGTGTGCG GTCAGCAATC GCTCGCGGTA
TTCTGCGTCG GCGTCTTCCT GTCCTTCATC GGCCATTTCA CGCTGATGCT GAGCTCGGGC
TCGCTGCTGG CGCAGATCCT GGTGAGCGCC GCAGGGATCG CGATCATGAC GACGGTGGCC
TATTACATCT CGTGGTCGAA GCGCCAGGAC AAGCCGCTGC CGAAGCCAGC CACACCCAAG
ACCGCCGCGG CGAAGTGA
 
Protein sequence
MKINATLPER GRDLRLDLFR GIANWAIFLD HIPDNVVNWI TTRNYGFSDA ADLFVFISGY 
TASFVYARMM IDRGFIVGAT RLFKRVWQLY VAHIVLFVIY IVAISYLATR FGVSEIIDEF
NVAGLVDHAS DTLAQGLILK FKPVNLDVLP LYIVLMGFFP PVLWLMLRQP DITMIASIVL
WLLARQMGWN FAAYPAGTWY FNPYCWQVLF VFGSWCALGG ARRSMGIIMA PATLYFCLGY
LLLALIMTMA GRFPDYGTML PHWLYSAFNP NDKTNLAPYR FLHFVVIVIL VIRFVPKEWP
GLEWKGFDPL VVCGQQSLAV FCVGVFLSFI GHFTLMLSSG SLLAQILVSA AGIAIMTTVA
YYISWSKRQD KPLPKPATPK TAAAK