Gene Rleg_4663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4663 
Symbol 
ID8007141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp26952 
End bp27968 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID644821599 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002972859 
Protein GI241113024 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.679325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAG CACAGACGAA GATGCAATTC CTGGGCTGCC TCGGCCTCGC CTCGATGCTT 
GCGGCTGCCT CACCGGCGCT TGCCCTCGAC AAGGTGAGCT ACGGGACGAA CTGGCTTGCC
CAGGCGGAGC ATGGCGGCTT CTACCAAGCC GTCGCCGACG GCACCTATGC AAAATACGGC
CTCGACGTCA CCATTGTCCA GGGCGGCCCG AATGCTGCAA ACAGCGCCCT ATTGATCTCC
GGCAAGCTCG ATTTCTACAT GGGCGGCCCT CAGGGAGAGA TATCCGCCGT CGAACAGGGC
ATTCCGCTGG TCGATGTCGC CGCGATCTTC CAAAAGGATC CGCAGGTACT GATCGCCCAT
CCGGACAACG GCGTCGACAA GTTCGAGGAC CTCGCCAAGC TGAAAACGCT GTTCCTCAGC
AAGGACGGCT ATCTCACCTA TTTCGAGTGG ATGAAGGCCA ACTTCAAAGG CTTCAAGGAC
GAGCAGTACA AGCCCTATAA CTTCAGTCCC GCCCCCTTCC TCGCAGACAA GGAGTCTGCC
CAGCAGGGGT ACCTGACCTC CGAACCCTAC GAGATCCAGA AGCAGGCAGG CTTCGAGCCA
AAGGTCTTCC TGCTCGCCGA CAACGGCTAC TCACCCTATT CGACGATGAT CACGACCACG
CAGGCGACGA TCGATGGCAA GCCCGACGTC GTGCAGCGCT TCGTCGATGC CTCGATCGAG
GGCTGGTACA ATTACCTCTA CGGCGACAAC ACCAAGGCGA ACGCGCTGAT CAAGAAGGAC
AATCCTGAAA TAACGGACGG CCAGATCGCC TATTCGGTCA CCAAGATGAA GGAATACGGC
ATCATCGAAT CCGGCGACAG CCTGGACAAG GGCATCGGCT GCATCACCGA CGCCCATTAC
AAGAAGTTCT TCGACGAGAT GACTGCTATC AAGGTCTTCA AGACCGACAC CGACTATACC
AAGGCCTTCA CGACGAAGTT CGTCTGCAAG GGCGCCGGAA TAGCGCTGAA GAAATAA
 
Protein sequence
MLKAQTKMQF LGCLGLASML AAASPALALD KVSYGTNWLA QAEHGGFYQA VADGTYAKYG 
LDVTIVQGGP NAANSALLIS GKLDFYMGGP QGEISAVEQG IPLVDVAAIF QKDPQVLIAH
PDNGVDKFED LAKLKTLFLS KDGYLTYFEW MKANFKGFKD EQYKPYNFSP APFLADKESA
QQGYLTSEPY EIQKQAGFEP KVFLLADNGY SPYSTMITTT QATIDGKPDV VQRFVDASIE
GWYNYLYGDN TKANALIKKD NPEITDGQIA YSVTKMKEYG IIESGDSLDK GIGCITDAHY
KKFFDEMTAI KVFKTDTDYT KAFTTKFVCK GAGIALKK