Gene Rleg_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2814 
Symbol 
ID8013754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2797157 
End bp2798155 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID644825385 
Productflagellin domain protein 
Protein accessionYP_002976614 
Protein GI241205518 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTATTT ACCAGCGCGT CTCGGTGGAT GCGGCGCTTC ATGTGCTGCG CGATATCAAC 
CGTAATATGG CGGTCACGCA AAACCACATC ACGACCGGTA TGCGTGTGGC AAAAGCCAGC
GACAATGCCG TCTATTGGTC GGTCGCCACC ACTGCGCGAA CCGACAACAA GGCGGTTTCG
GCGATCCAGG ATGCGCTTGG TATGGCGGCG GCGACGATGG GAACGGCCTA TACCGGCGTC
CAGAACGTCA TCGATGTCGT CTCTGAGATC AAGGCCAAGC TGGTTGCCGC GACCGAAGAC
GGGGTCGACA AGGACAAGGT CAATGAAGAG ATCAAGCAGT TGCAGGAGCA GTTGCGCAGC
GTCTCCGAGG CGGCGACTTT CAATAGCGAC AACTGGGTGG TTCTCAACAA CGATGCGACA
CCGACGCAGC CGCGCCAGAT TCCGGCCTCC TTCATCCGCA ATGCCGACGG GACCATCTCG
GTCGGCATGC TGAGCTATCA TATCGACACG ACGCCGAGCG GGAGCACGAC CTCTAAGGAC
GCACGCTACC TGATCGATGA TCGCGCCACC GGTTCGGGCG AATACGGCGT GCTGACATCG
GCCTATTTCG CCACCGAGCT CGGCGCGTCG CAGGACTACG TGCTGATGCA GAGCAAAAAC
GGCACCACCA CAGGGCAGGT AGTGATTTCG CTCTCGGCTA GCACGACGAA AGGACAGGTC
GGCGAAATGA TCAGCGTCGT CGATGCCGCG CTGTCGCAGC TGACGACGGT CGGTTCGGCC
TTCGGCGCGT TGGAGAAACG CATCAACCTG CAGAACGACT TCGCCACGAA ACTGCACGAC
AACAATGCCA CCGGCATCGG CCGGCTTGTC GATGCCGACA TGGAGGAGGA GTCGAGCAGG
CTCAGGGCGC TGCAGACGCA GCAGCAACTC GGCCTGCAAT CGCTGAACAT CGCCAACGCA
ACCTACGATA CGGTGCGGCA GTTGTTCCAA AATTTCTAA
 
Protein sequence
MSIYQRVSVD AALHVLRDIN RNMAVTQNHI TTGMRVAKAS DNAVYWSVAT TARTDNKAVS 
AIQDALGMAA ATMGTAYTGV QNVIDVVSEI KAKLVAATED GVDKDKVNEE IKQLQEQLRS
VSEAATFNSD NWVVLNNDAT PTQPRQIPAS FIRNADGTIS VGMLSYHIDT TPSGSTTSKD
ARYLIDDRAT GSGEYGVLTS AYFATELGAS QDYVLMQSKN GTTTGQVVIS LSASTTKGQV
GEMISVVDAA LSQLTTVGSA FGALEKRINL QNDFATKLHD NNATGIGRLV DADMEEESSR
LRALQTQQQL GLQSLNIANA TYDTVRQLFQ NF