Gene Rleg2_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3031 
SymbolftrA 
ID6981776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3090066 
End bp3091067 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID643397741 
Producttranscriptional activator FtrA 
Protein accessionYP_002282524 
Protein GI209550607 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00937108 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA GCGTAAAGAT CATGCCAAAC TTATCACCAC AGCAGACGAC AGGACCGTTG 
GTCACAGCCC TTGCCTATGA CGGGCTCTGC ACCTTCGAAT TCGGCATCGC CTACGAGGTC
TTCGGCCTGC CGCGCCCGGA GATGGGCGAA GGCTGGTATC GCTTCTCGGT CTGCGGCATC
GAGCCGGGAC CGCTTCACGC CGCCGGCGGG TTGACGGTCG CGGTCGACAA GGGACTGGAG
ATCCTTGATG AGGCGGATCT GATCGTCGTG CCCGGCTGGC GGGCGATCGA CGCACCGGTG
CCCGAGCCGC TCGCCGAGGC GCTCCGGGCA GCGCATCAGC GCGGCGCGCG CATCATGTCG
CTCTGCTCCG GCGTTGCGGT TCTGGCCGGA TCCGGATTGC TTGCCAACCG CAAGGCGACG
ACGCATTGGC GTTATGTCGC CTCGATCGCC GCTCGTTATC CCGATATCGC GCTCGATGCC
GGCGTTCTCT ACATCGATGA GGGCAGCCTG TTGACGGCGG CGGGCAGTGC CGCCGGCATC
GATCTCTGCC TGCATGTGGT GCGCGGCGAT TTCGGCTCGG AGGCCGCAAA CAGCGTCGCC
CGCCGCCTCG TCGTGCCGCC GCACCGCGAA GGAGGGCAGG CGCAGTTCAT CAGCGCCCCG
GTTCCGGAAG AGCGTGAGGG CATCCGTCTC GGCCCATTGA TCGAATGGAT GCGCGAAAGC
CTTTCGCAGG AGCAGCCGAT CAGGCTGCTT GCGAAAAGAG CTGGCATGAG CATGCGCACT
TTCCAGCGCC GCTTCGAAGC GACGACGGGT CTCAGCGTCG GCGAATGGCT GCTGAAGGAG
CGGCTGCGCC ATGCCCGTGA CCTTCTCGAG AAAGAGCTTG CGGTCTCGCT CGACGACATC
GCGGTATCAA GCGGCTTCGG CACGCTGGCG ACGATGCGGC ATCATTTTCG CAGGCGGCTC
GGGACGAGCC CGAGCGCTTA CAGGCGGTCG TTCGGTCTTT GA
 
Protein sequence
MTDSVKIMPN LSPQQTTGPL VTALAYDGLC TFEFGIAYEV FGLPRPEMGE GWYRFSVCGI 
EPGPLHAAGG LTVAVDKGLE ILDEADLIVV PGWRAIDAPV PEPLAEALRA AHQRGARIMS
LCSGVAVLAG SGLLANRKAT THWRYVASIA ARYPDIALDA GVLYIDEGSL LTAAGSAAGI
DLCLHVVRGD FGSEAANSVA RRLVVPPHRE GGQAQFISAP VPEEREGIRL GPLIEWMRES
LSQEQPIRLL AKRAGMSMRT FQRRFEATTG LSVGEWLLKE RLRHARDLLE KELAVSLDDI
AVSSGFGTLA TMRHHFRRRL GTSPSAYRRS FGL