Gene Rleg_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3936 
Symbol 
ID8014752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4009875 
End bp4010885 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID644826505 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002977716 
Protein GI241206620 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.541401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTC TCACCCGTCG CCAGACGATC TTCGCCGCCA TCGCGGCAAG CGTCGCCGGC 
CGCACTGCCT TTGCCCAATC GGCGCCCGCA AAGGTTCGCA TCGCGCTCGA CTGGACGCCC
AACACCAACC ATATCGGCAT CTATGTCGCC AAGGCGAAGG GCTTCTATGC CGATGCCGGG
CTCGATGTCG AGATTCTTCC CTTCACCGAT ACCAGCGCCG GAACGCTGGT GTCGAACGGC
GTTGCCGATT TCGGCATCAG CAGCGAGATC GAGACGCTGA CGCAACGCGC TGGCGGCGGT
GACGTGAAGA TGGTCTACGG CGTCGTCCAG ACGGAAACCG CACGCCTGAT CTTCAAGGGC
GGACGCGACG ACATCAAGAG CCCGAAAGAC CTCGACGGCA AGACCTATGG CGGCTTTGGT
GGCACCTGGG AGAGCGCGCT GATCTCGGCG ATGATCCGCA ATGACGGCGG CAAAGGCGAC
GTCAAGACCG TCACCCTCGG CACCTCCGCT TACGAGGCGC TGGACAATGG CTCGATCGAT
TTCACGCTGG AGATCTACAC CTGGGAAGGC ATCGCTGCCG AACTGGAAAA CCGCAAGATC
GGCCGCTTCC ACTATTCCGA TTATGGCATT CCCGACGAGC AGACGACGGT CATCGTCTCC
AGCGACGCCT ATCTCTCCGC AAGTCGGGAC CACGCCCGCG CCTTCATCCA GGCGACACGA
AAGGGTTATG CCTACTCCGT CGACCATCCC GACGAAGCCT GCGACCTGTT GATCTCTGGA
AGCAACGGCG CACTGATGAA TACGGAACTG GTAAAGGCTT CTCAGAAGGC ATTGATCGAG
GGCCACTTCC TGAAATCCGA GGCCGGTGTG ATCGGTAAGC TCGACCCGGC AAAGGCCGAG
GCCCTGGGTG GCTTCCTGAT CGAGAATGGT ATTCTGGTCG ATGCGAATGG CGCCGCACTC
AAGGAGAAGC CGGACTTTTC CACCTATTAT ACCAACGAAC TTCTCGACTG A
 
Protein sequence
MLLLTRRQTI FAAIAASVAG RTAFAQSAPA KVRIALDWTP NTNHIGIYVA KAKGFYADAG 
LDVEILPFTD TSAGTLVSNG VADFGISSEI ETLTQRAGGG DVKMVYGVVQ TETARLIFKG
GRDDIKSPKD LDGKTYGGFG GTWESALISA MIRNDGGKGD VKTVTLGTSA YEALDNGSID
FTLEIYTWEG IAAELENRKI GRFHYSDYGI PDEQTTVIVS SDAYLSASRD HARAFIQATR
KGYAYSVDHP DEACDLLISG SNGALMNTEL VKASQKALIE GHFLKSEAGV IGKLDPAKAE
ALGGFLIENG ILVDANGAAL KEKPDFSTYY TNELLD