Gene Rleg_5140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5140 
Symbol 
ID8007000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp540988 
End bp541926 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content65% 
IMG OID644822053 
Productprotein of unknown function DUF58 
Protein accessionYP_002973313 
Protein GI241113478 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG CGGGCGTCTA TGTTTCGACG GACGAACTGG TCGCGCTCGA AGCGAGAGCC 
CGAGATCTGA GCTTCGTCCA GAAGGCGCGC AGCCATCAGC AGCTTGCAGG CCGCATGCAA
TCGGCGATGC GCGGCCGGGG ACTGATCTTC GAAGAACTGC GCGACTATCT GCCCGGCGAC
GACATCCGCT CCATCGACTG GCGCGTCACC GCGCGAACCA GCAGACCGGT GGTCCGCATC
TATTCCGAGG AAAAGGAGCG GCCCGCGCTG ATCATCGTCG ACCAACGGAT CAACATGTTC
TTCGGCAGCA GGCGATCGAT GAAATCGGTC ACGGCAGCGG AAGCCGCGAT GCTCTGCGCC
TGGCGCATAC TGGGTTCCGG CGACCGGGTC GGCGGCTTCG TCTTCGGCGA AAGCGCAACG
AGCGAGGCAA AACCGCATCG CAGCCGTAAT GCGGTGATTG CCTTTGCGGA ACAAATCGCA
CGGCAGAACG CGAGCTTGCG CGCAGACAGC AAAAGCGAGC CTGACCCGCA GGCGTTGGAC
ACGGTTTTGT CGGCGGTCGC AAATATCGCC CACCACGACC ATCTCGTGGT CGTGGTCTCC
GACTTCGACG GCCATACCGC GACGACGCAA GACATCCTGC TGAGGCTCTC GAGCCGCAAC
GACGTGATCT GCCTGTTGAT CTACGACCCC TTTCTACTGG ACCTGCCGAC CTCGGGCGAC
ATCGTCGTCA GCGGCGGCGG CCCGCAGGCC GAGCTGGCTC TGCGGACACC AAGCGTCCGA
TCGTCGATCG ACGCGTTCGC CCGCAACCGC GGCCGCGAGC TGAGAGCGTG GCAGCGCCGG
CTCGGGCTTC CGATACTGCC CATATCGGCC GCCGAGGAAA CCGCGCCGCA GCTCAGGCGT
CTGCTGGAGC AGTCTGCGTG GCGGCAACGG AGGCGTTGA
 
Protein sequence
MSDAGVYVST DELVALEARA RDLSFVQKAR SHQQLAGRMQ SAMRGRGLIF EELRDYLPGD 
DIRSIDWRVT ARTSRPVVRI YSEEKERPAL IIVDQRINMF FGSRRSMKSV TAAEAAMLCA
WRILGSGDRV GGFVFGESAT SEAKPHRSRN AVIAFAEQIA RQNASLRADS KSEPDPQALD
TVLSAVANIA HHDHLVVVVS DFDGHTATTQ DILLRLSSRN DVICLLIYDP FLLDLPTSGD
IVVSGGGPQA ELALRTPSVR SSIDAFARNR GRELRAWQRR LGLPILPISA AEETAPQLRR
LLEQSAWRQR RR