Gene Rleg2_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4470 
Symbol 
ID6977564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp103576 
End bp104889 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content61% 
IMG OID643393648 
Productnitrate ABC transporter, substrate-binding protein 
Protein accessionYP_002278466 
Protein GI209546548 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.214365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TTTCGACGAC TGGGATTTCG ACAACCGGCA TTACCCGCCG CAGCATGCTC 
AAGATGACCG CGACGGCTGC CCTCATCGGC GCCGTCAAGA CTGCCTTTCC GTCCGGCGCT
TTTGCCGCCG GGACAGGCCC CGAAGTGAAA GGCGTCAAGC TCGGCTTTAT CGCTCTCACG
GATTCCGCGC CGCTGATCAT CGCTAAAGAA AAGGGCTTTT TCGATAAGCA CGGCCTTCCG
GAAACGGATG TGGCCAAACA GGCCTCCTGG GGCGCGACCC GCGACAATCT GGTGCTGGGC
GGCGCAGCAA ACGGCATCGA CGGTGCCCAT ATCCTCTCGC CGCTGCCCTA TCTCATGCAT
ACCGGCAAGG TGACGCAGAA CAACAAGCCA GTGCCGATGG CGATCCTCGC CAGGCTCAAC
CTCGACAGCC AGGGCATTTC CGTCGCCAAG GAATATGCCG AAACCGGCGT GCAGCTCGAC
GCCTCCAAGC TCAAGGCCGC CTTCGAGAAA AAGAAGGCGG AAGGCAAGGA GGTTAAGGCC
GCCATGACCT TCCCGGGCGG CACCCACGAT CTCTGGATCC GTTACTGGCT CGCCGCCGGC
GGCATCGACC CGAACAAGGA CGTTTCGACC ATCGTCGTGC CGCCGCCGCA GATGGTTGCC
AATATGAAGG TGGGCAACAT GGACGTCTTC TGTGTGGGCG AACCGTGGAA TGAGCAGCTC
GTCAACCAGG GCATCGGCTT CACCGCGGCG ACCACCGGCG AGCTCTGGAA GGGGCATCCC
GAAAAGGCGC TGGGGCTGCG TGCCGAATGG ATCGAAAAGA ATCCCAATGC TGCCAAGGCA
CTGCTGATGG CTGTCATGGA GGCTCAGCAA TGGTGCGAGA GCATGGATAA CAAGGCGGAG
ATGGCCGATA TCCTCGGTAA GCGCCAATGG TTCAACGTTC CGAGCAAGGA CGTGCTCGGC
CGCCTCAAGG GCGACATCAA TTATGGCAAC GGCCGCGAGG CCAAGGCCAC TGACCTCTAC
ATGAAGTTCT GGAAAGACGG CGCCTCCTAT CCGTTCAAGA GCCACGACAC CTGGTTCATG
ACGGAAAACA TCCGCTGGGG AAATCTGCCG GCAACGACCG ACGTCAAGGC GCTGGTCAAT
CAGGTGAACC GTGAGGACGT CTGGCGCGAG GCCGCCAAGG ATCTCGGCGT TGCTGCAGCC
GACATCCCCG CATCGTCCTC CCGCGGCAAG GAGACCTTCT TCGACGGCAA GGTTTTCGAT
CCCGAAAATC CCTCCGCCTA TCTCGACAGC CTTTCGATCA AGGCCGTCTC CTGA
 
Protein sequence
MKKISTTGIS TTGITRRSML KMTATAALIG AVKTAFPSGA FAAGTGPEVK GVKLGFIALT 
DSAPLIIAKE KGFFDKHGLP ETDVAKQASW GATRDNLVLG GAANGIDGAH ILSPLPYLMH
TGKVTQNNKP VPMAILARLN LDSQGISVAK EYAETGVQLD ASKLKAAFEK KKAEGKEVKA
AMTFPGGTHD LWIRYWLAAG GIDPNKDVST IVVPPPQMVA NMKVGNMDVF CVGEPWNEQL
VNQGIGFTAA TTGELWKGHP EKALGLRAEW IEKNPNAAKA LLMAVMEAQQ WCESMDNKAE
MADILGKRQW FNVPSKDVLG RLKGDINYGN GREAKATDLY MKFWKDGASY PFKSHDTWFM
TENIRWGNLP ATTDVKALVN QVNREDVWRE AAKDLGVAAA DIPASSSRGK ETFFDGKVFD
PENPSAYLDS LSIKAVS