Gene BBta_5530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5530 
Symbol 
ID5152288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5745820 
End bp5747370 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID640560274 
Productputative flagellin protein, C-terminus 
Protein accessionYP_001241396 
Protein GI148256811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.567075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGT CAGGTATCGT TCTCTCGGCG TCGGTGCGCC AGAACCTGCT GTCGCTCCAG 
TCGACGGCGC AGCTTCTCGC CACCACCCAG AACAATCTCT CCACGGGCAA GAAGGTCAAC
TCGGCACTCG ACAATCCGAC CAACTTCTTT ACCGCCCAGG GCCTCGACAA CCGCGCTTCC
GACATCTCCA ATCTGCTCGA TGGCATCGGC AATGGCGTGC AGGTTCTGCA GTCTGCCAAC
ACCGGCATCA CCTCGCTGCA GAAGCTCGTC GACAGCGCCA AGTCGATCGC CAACCAGGTG
CTGCAGAGCG CGGTCGGCTA TTCCACGAAG TCGAACGTGA CCTCAGCTGC GCTGACCGGC
GCGACCACCA CCAGCCTGAT TGGCGCCAGC TCGACCGCCG TCACCGGCTC CGTCGTGCTG
AACGACAACA CCTCGACCGC GGTGGCGATC ACCGGCTCGA CCAAGCTGTC GGGTACGCCG
AGCACCTCGT CGAACGACCT GGCCTCCAGC ATCACCACCG GCGACACGCT GGTCGTGAAC
GGCACCACCT TCACCTTTAT CGCCGGCACG TCCTCGTCCG GCACCAATAT CGGCGTCGGT
GACAGCGTTA CGAACCTGCT GTCGACCATC CAGAGCGCGA CCGGCGTCAC CTCGTCGATC
ACGGCTGGCG CGATCACGCT GACGCCGCCG GCGGCAGGGC TGACATTGTC CGGTACGTCG
CTGGCCAAGC TCGGTCTCAG CGCGGTCGGC AATTCGCTGT CCGGGCAGAC GCTGACAATC
GCCGCCACAG GAGGTGGCAC GGCGACCAGT GTCACGTTCG GTCTGGGAAC AGGGCAGGTC
AACTCGTTGA ACGACCTCAA TGCGAAGCTT GCGGCCAACA ACCTGCAGGC ATCGTTCGAC
ACGGCCACCA GCAAGATCAC GATCTCGACC ACGAACGATG CGGCCTCGGC GACGATCGGT
GCGATCGGTG GTACGGCGGC GGCGTCCAGC CAGTCCTTCA ACGGTCTGAC GGCGGCGGCT
CCGGTGGCTG ATGCGACCGC GCAGTCGCAG CGGTCGAGCC TGGTCGCGCA GTACAACAAC
GTGCTGGCGC AGATCAACAC GACCGCAGCT GATGCCTCGT TCAACGGTGT CAACCTGCTC
AACGGCGACA CGCTGAAGCT CACCTTCAAC GAGAACGGCA AGTCCACGCT GTCGATCACC
GGCGTGACCT TCAACACCGG CGGCCTGGGT CTGTCGACCC TGACCGCAGG CACCGACTTC
CTCGACAACA ACTCGGCGAA CAAGGTGATC GGCGTGCTCA ACACGGCGAG CTCCACGCTG
CGCAACGAGG CGTCGACCTT GGGTTCGAAC CTGTCGGTGG TGCAGATCCG TCAGGACTTC
AACAAGAACC TGATCAACGT GCTGCAGACC GGCTCGTCGA ACCTGACCTT GGCCGACACC
AACGAGGAAG CGGCCAACAG CCAGGCGCTG TCGACCCGCC AGTCGATCGC GGTGTCCGCG
CTGTCGCTCG CCAACCAGTC GCAGGCAAGC GTGCTGCAGC TGCTGCGCTG A
 
Protein sequence
MKMSGIVLSA SVRQNLLSLQ STAQLLATTQ NNLSTGKKVN SALDNPTNFF TAQGLDNRAS 
DISNLLDGIG NGVQVLQSAN TGITSLQKLV DSAKSIANQV LQSAVGYSTK SNVTSAALTG
ATTTSLIGAS STAVTGSVVL NDNTSTAVAI TGSTKLSGTP STSSNDLASS ITTGDTLVVN
GTTFTFIAGT SSSGTNIGVG DSVTNLLSTI QSATGVTSSI TAGAITLTPP AAGLTLSGTS
LAKLGLSAVG NSLSGQTLTI AATGGGTATS VTFGLGTGQV NSLNDLNAKL AANNLQASFD
TATSKITIST TNDAASATIG AIGGTAAASS QSFNGLTAAA PVADATAQSQ RSSLVAQYNN
VLAQINTTAA DASFNGVNLL NGDTLKLTFN ENGKSTLSIT GVTFNTGGLG LSTLTAGTDF
LDNNSANKVI GVLNTASSTL RNEASTLGSN LSVVQIRQDF NKNLINVLQT GSSNLTLADT
NEEAANSQAL STRQSIAVSA LSLANQSQAS VLQLLR