Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3216 |
Symbol | |
ID | 3836662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3708101 |
End bp | 3711400 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827331 |
Product | hypothetical protein |
Protein accession | YP_428298 |
Protein GI | 83594546 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATT CTACCCAGAC CTCCCAGCCC CTTGCCGGCA CCGATACTCC CCCCCAGGAT CCACCGGGCG GGACGCCGAT CGTCGAGTCC CCCAAGCCCC CCGCCAAGTC GGCGGACGAG GCGCCTGGAG GCCCACGGGG CGGGACGCCC GGAGACTTAA GGGGCGGAAC GTCCGGAGAC CCAAAGGGCG GAACGGCGGC GGAGCGGTCG GCCCACCCCC CCCTGCCCCT GGGCATTCTG CCAACCGCCG GCGGGACCGA CGACGGCGAC GGGACCGCCG AGAACACCGA GCGGTCTTTC ACCTTGCCGG CGACAACGGA GCGCGGCGAG CGCGCCGGCC GTGGCGAGCG GGGGGCGCGC CGCCCGCGCA GCGAACGCCT GCGGCGCAAA CACCGCCTGG GCAGCACGCT GATCTGGGTG GCCGGGCTGG GCACCGTCTT CTGGCTTGGT GTCTTTGCCG TTTACGTCGG GGCGACCGTC GGCATCGAAA ACATCTTGCT GTACCAGCCG CTGGAAATCG GCGGCATGGT CTCGGGCGCG CTGGTGCCGA TCCTGCTGCT GTGGCTGGCC GTCGCCTTTC ACGACCGCGC CGCCAAGTTC GGCGAGGAAG CGGAAATCCT GCGCCACTAC CTCGAGCAGC TTGCCTATCC CGATGACGCG GCGGTCGATC GCGTCGGCCA GATCACCCAG GCCCTGGTCG CCCAGGCGAC GGCCCTGAAC GAGGCCTCGC GCACCGCCGT CGCCCGCGGA TTGGCGATGC GCGACGATCT GCGCCGCGAG ACCGAGGCCA TGGAAAGCAC GGCCGCCCGC ATCGCCGCCG ACGCCACCCA GCTCTCCAGC GGCCTCGACC GCCGGATCGA GGGGCTGACG CTCGCCTATA CCAAGGCCGG CGACCAGGGC CGCGAGTTGG AAACCCTGCT CGACCGTTTG CAGGGGCAGT TCACCCAGAC CGTGCGCAAG ACCGGCGACG AGGCCCGCGC CTTTCGCGAC GCCCTGACCG ACGACCTTGT GCATCTCGAC ACCCTGATCC AGCGCTCGGC CGAACAGACG CGGATGCTGG CCGGCCTTGG CCAGTCGATC GAACGCATGC TTGACGACTC CGAAGGGGTC GCCGGATCGA TCGAACGCCG GGCCGACACC TTGAAGATCC TCTATGGCGA TCAGAACCGG GCTCTGGCCC GGGCCAGCGA ACAATTGTCC GATGAGGCGG CGCGGATCGC CGAAACCCTC GGCAAGCAGT CGGCGACTTT GGCCCAGGTC ACCGAAAGCA TGGTCAGCCG GGTGCGGCTG GTCGACGAGA CGCTGACCTT ACAGGGCCGC AATCTCGCCG AAACCAGCGA CGCGGCGCTC GGCCGCCTCA AGGCGGTCGA TGGCCTGCTT AGCAAACGCA CCGAGGAATT GACCGGAATC GTCGAAGAGG TTCTAGGCCG GCTCGACGAA ACGACCGACG CCTTCACCCA GCGTTCGCGC GATCTGGCCA CCGCCGGCGA AGAGGCCTCG CGCGGCATGG ACGGCGCGGC CGAAACCGCC GGCAACGCCT TGAAGGCCAT GGGGCTGGCC ATGGGCAAGG TCCAGGAAAA GAGCAAGGGC GTGGCCGAGT TGGTGATCGG CCATGCCGAA ACCCTCGACC GTCTGGCCGG TCAGACCGCC ACCCAGACCG AGTCCATCCG CAGCGGCCTC AAGGGCCAGA CCGACGACCT GATGGGCGTG CTGACCTCGG TCCGCAGCCA TATCGATCTG GCCTCGGCGG CGATGGGCAA ACAGGCCCGC GACCTCAACG CCACCGCCGA AGGCGTGGTC AGCGCCCTGA AAGACGTGTC CGGCTTGGTC CATACCAGCG GCGGCGAATT GGCGCAGACG GCGACCCGGG TGACCGTCGA TCTTGAAGCC GCCGCCAGCA CCCTGCGCCG CAACGCCACG GAACTGGGAC AGGCCGGCAA GGGCACGGTC GACAGCCTGC GCAACGCCGG CGTCACCCTG GTCGAACAGG CCAGTCTGGT CAAGGACGCG GCCGCCATCG CCGGCAAGGC GATCGCCGAG GCCGATGGCG CCATGCGCGG CCGCGCCTCG GCGGTGTCCG AAGCCGGGCT AACCGTCGAA AAGACGCTGA GCGTCGCCGC CGAGAAGTTC AACATCCAGG CCGGGGCGCT CGATCGCGTC CTTGCCTCCT CGCGCCAGGG ATTGGAAACC GCCCTCTCCG ACCTCGGACG CCAAACCTCC GAGATGGGAA AATCGGCCGA AACGGCGGCG CGGCGCATCG TCGCCCTGTC CGAGGCCATG GGCCGGGCCG GTGGCGATTT CGACGAGCGC GCCGCCCGGG GCGTCGCCCT GGTCACCCAG GCCGCCGACC GCCTGGGCGA GGTGGTTCAG GAAGTCTCGA CCAACGCCGA GCGGGTGACC GGCGCGGTGC GCGCCGCCGC CTCGGAATTC CGCCGCGAGG TCGGCGATGT TTCCGACGGT TCGAAGGCCG CCCTGCGACC GATCCGCGAC TCCCTTTCGG CGCTGCGCCG GGAGACCGAA CAATTGAGCA GCGTCGCCGC CAATGCCGCC GAAAACGCCC TTGGTCCCTA TCGCTCGGCC CTGGCCTCGC TGCGCGGCGA AACCGAACAA CTCGGCCTGT TGGGCGGCAA TGCGGTCGAG GTCATCGTCA CCCCCTTCCG TGAGGCGCTC GCCACCCTGC GCAGCGATAC CGACGCCCTG GCCAATGACG GCAAGGCGGC GGCCGAAGGC GCCATCGCCC CCTTCCGCGA GGCGCTGGCC GGCCTGCGCC GCGATACGGA AACCCTGACC TCGGCCGGAC GGGTTCTCGC CGAGGCCACC CACAAAACCT CGGGCGCCTT CGTCAAGCAA ACCGAAGGGC TGATCGCCGC CTCCCACGAA GCCGAAAAAC GGCTGCGCGA GATGAAATCC CTCGAGGACG ATCTCGATAT CGAAAGCTTC CTCAACTCCT CGACCTATGT CATCGAGAAG CTCGATTCCC TGGCCGTGGA CATCACCCGG CTGTTCGCTC CGGCGCGCGA GGAAGACCTG TGGCGGCGCT ATCACAAGGG CGACCAGGGG GTTTTCCTGC GCCACCTCGC CCGCGCCATC ACCCCCGCCC ATGCCGAAGC CATCCGCCTT GCCACCACCA AGGACAAAAG CTTCCGCGAC TACGTGTCGT CCTATGTCAG CGAATATGAA TCCCTTTTGG AAACCACCCG CAAATCGCCG CGCGCCGACG TGCTGACCGC CCTGTTCATC GGCTCTGATC TCGGCAAGGT CTATATGGTG CTGGCCAAGG CGCTGGGCCG GCTGGAATAA
|
Protein sequence | MSDSTQTSQP LAGTDTPPQD PPGGTPIVES PKPPAKSADE APGGPRGGTP GDLRGGTSGD PKGGTAAERS AHPPLPLGIL PTAGGTDDGD GTAENTERSF TLPATTERGE RAGRGERGAR RPRSERLRRK HRLGSTLIWV AGLGTVFWLG VFAVYVGATV GIENILLYQP LEIGGMVSGA LVPILLLWLA VAFHDRAAKF GEEAEILRHY LEQLAYPDDA AVDRVGQITQ ALVAQATALN EASRTAVARG LAMRDDLRRE TEAMESTAAR IAADATQLSS GLDRRIEGLT LAYTKAGDQG RELETLLDRL QGQFTQTVRK TGDEARAFRD ALTDDLVHLD TLIQRSAEQT RMLAGLGQSI ERMLDDSEGV AGSIERRADT LKILYGDQNR ALARASEQLS DEAARIAETL GKQSATLAQV TESMVSRVRL VDETLTLQGR NLAETSDAAL GRLKAVDGLL SKRTEELTGI VEEVLGRLDE TTDAFTQRSR DLATAGEEAS RGMDGAAETA GNALKAMGLA MGKVQEKSKG VAELVIGHAE TLDRLAGQTA TQTESIRSGL KGQTDDLMGV LTSVRSHIDL ASAAMGKQAR DLNATAEGVV SALKDVSGLV HTSGGELAQT ATRVTVDLEA AASTLRRNAT ELGQAGKGTV DSLRNAGVTL VEQASLVKDA AAIAGKAIAE ADGAMRGRAS AVSEAGLTVE KTLSVAAEKF NIQAGALDRV LASSRQGLET ALSDLGRQTS EMGKSAETAA RRIVALSEAM GRAGGDFDER AARGVALVTQ AADRLGEVVQ EVSTNAERVT GAVRAAASEF RREVGDVSDG SKAALRPIRD SLSALRRETE QLSSVAANAA ENALGPYRSA LASLRGETEQ LGLLGGNAVE VIVTPFREAL ATLRSDTDAL ANDGKAAAEG AIAPFREALA GLRRDTETLT SAGRVLAEAT HKTSGAFVKQ TEGLIAASHE AEKRLREMKS LEDDLDIESF LNSSTYVIEK LDSLAVDITR LFAPAREEDL WRRYHKGDQG VFLRHLARAI TPAHAEAIRL ATTKDKSFRD YVSSYVSEYE SLLETTRKSP RADVLTALFI GSDLGKVYMV LAKALGRLE
|
| |