Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3791 |
Symbol | |
ID | 3721529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 894592 |
End bp | 899766 |
Gene Length | 5175 bp |
Protein Length | 1724 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640073440 |
Product | hypothetical protein |
Protein accession | YP_355277 |
Protein GI | 77465774 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGCAT CGAACATGGG CGCCATGCGC GCCACCCTCG GCCTCGACGT CTCGCAGTTC GAGAACCGGG CTCAGTCCGC CGGCCGGACG GCAAAGCAGA TGTCCGACGC CATGGCGCGG GCGTTCGAGG TTGCGAAGGC CTCCGCCATG GGCGGCGCCC GGAGCTTCGA GGAGCTGCGG GCCTCGATCG ATCCGACCTT CGCCGCGACG CAACGCTATG CCGCCATCCA GCGCGAGCTT GCGGGGATGG TGGAGAGCGG TGCCGCCAGC CAGCGGGCGG CGAACCTCGT CCTCGAGCAG GCCGCCGCGA AGTACATGGG GGTGGAGACG GCGGCCGAAC GGACGGCACG GGCGCAGCGG GAGACCTCTG CCGCGGCGGA TGCGGCCGCC CGCGGATATA CCTCGCTCCG GGCGCAGGTC GATCCGCTCT ATGCGGCCTC GAAGCGGTAC GAGCAGGCGC TCGAGACGCT GAATGCGGCG CAGGCGGCGG GGGTCATCGG CGATCAGGAA CGCGCGCGGA CGCTCAAGCT GCTCGACGCC CAGATGATCT CGGCGGATCG CGCGACGACG GCGGCCACTC TAGGCATCGG TCGGTTCACG CCCGCGATCA CCAATGCCTC GTTCCAGGTG CAGGACTTCG CGGTGCAGGT CGCCTCGGGT CAGTCGGCGA TGATCGCCTT CACCCAGCAG GCGCCCCAGC TTCTCGGCGC CTTCGGCTTC TCCGGGAAGC TGGCCCTGAT CGGGGCGGGC CTCGGGACGA TCCTCGCCAT CGGCGCAGCA CTCGTTCCCG TGTTCCAGCG CATGGCCGCC GGGACTGCTG GGCTGAAGGA GAAGATCGAC GATCTGACGA AGTCGGTGGA CGGCTACAAG AGCGCCTCGG AACGCGCCCA CAAGTCGGCC ATGGAACTCA CCGCCGAGTT CGGCGCCAAC GCCGCGAGCG CGCGGGAGGC CTATGCCGCC CTGCAGAGCC TCGCGCAACT CACCGCTGTT CAGGATCTTC GCAAGGCCAT GGAGGGCATC GGGGATGCCA TTCCTTCGTC CTTCGGTCGC CTCATATCGC AGCTGGATGC CACGGGCCGC GTCGGGGCGC AGGCTGCCGC CAACCTCCGC ACTCAGTTCG GGCTGACCGC GGAGGAAGGG AAGCGCCTCG CGGAAGCCAT CCACGCCGTG GGAGCGTCCT CCGGCCCCGA GGAGGCGGCA AAGCGCGCGG CCGACCTTCA TCGGACGATG GTCGACGTGT TCGGAGCGGT CGAGGCCATT CCGCCCGAGT TCCAGACCCT CGCGCGCCTC GCGGCCGAGG GCAATGTCGC GGCCCTGCAG TTGCTCGGGA CCATGAACGG TCTCACCGGC AGCATTTCCT CTGCGGCATC GGAAGCGGCG CGGCTGGCGG CAAACCTCGG CTCCGCGGCC AACAGCGCCG CGGCGGAGGC CTCGCGCCAG ATCTCGATCG TCGACGCGCA GATGGCCGCG ATCCGCGCCG GCCAGGACGA GGTGATCGCG GGCAAGCGCG CAGCCATCGA ACTTGATAGG CAGGCCTATG TGGCGGCGAA GATGGCGGCG GGCATGGATG CGGACCGCGC GGAGTCTCTC GCGTCGCAGG TCTTCGCCTC GCAGGAGATC CTCGCCGTCC GGGAAGAGGA GCTGCGGGCG ATGCAGAAGG CCCGCTCGGA GGCCGAGAAG GCCGCCAACG GAGGCGCGAA GGCGGCTGCG GCCGAGGCGA AGGCTCTCGA CAAGAGCGCC CGGAAATATC TCGAGATGAT CGACCCGATG GAGAAGTATC GCCGGAAGCA GGCCGAGCTG AAGAAGCTCC TCGATGCCGG CAGGATCTCG GCGGACCAGT ATCGGCGGGC CCTGGCGGAG ATCGCGGCCG AGATGGGCGA GAACAACCCC GTGTTCGAGG AGTTCCGCAG CGCCGTGGGC TCTGCCGTCG ACTGGATGCT CGACGGCTTC CGCGGCGGCT TCAAGGGCCT CCTCGACATC GCGAAGAACA CGCTGCGACA GATCATCGGC ATGTTCATGA CGAACCGGAT CACGCTCTCG CTCGGCCTGG GGGTCTCTGG CGCGGCCGCA GGTGCGGCCG GGGCTGCCGT CGCGGGCGCC GGCGGCATGG GCACGCTCAG TGCGCTCGGC GGCATCGCGA GCGGGATCAA TGCGGTGCTT GGCGGCATCG GCGGCGCGCT TTCCGCCTTC GGCACCGGCG CCTGGGGCGC GCTCTCGAAC TTTGCAACGG GCGGCCTGTC CGGCGGCCTG GCCTATATCG GCAGCTCCCT GAACTTCGCC ACCAGCGGTC TCGCGGGCTT CGCGCAGGCG GCGGGCGCCA TCCTCGGCCC CATTGCCGCG GTGGCGGCGG CTTTCTCCTT CTTCGGCTCG AAGACGAAGC TCCTCGACGC AGGCCTGCGC GTCACCGTGC GCGAGCTGAA CGCGATGGTG GAGAGCTACA GGAAGGTCGA GAAGTCCCGG TTCGGCGGGC TGTCGAAGTC GCGGCGCACG AGCTATGGCC TCGCCGACGG CGCGGTGGCG GGCCCCATCG TCAAGGCCGT GGGTCAGATG CAGGCCTCGG TCATGGATGT GGCGGACACG CTCGGCATCG GGGCCGAGGC CTTCAAGGGC TTCGCGGCCT CGGTGAAGTT CTCGACCAAG GGGCTCTCCG ACGAGGAGAT CGGGGCGAAG CTGCAGGAGA AGCTGACCGA GCTCGGCGAC AACTTCGCCG CCCGCGCCTT CGGCTATGTC GGGAAGAACG ACCAGGCGAT CCGGGACCTC GAGAAGCGGA TCGCCGAGGG GACGTCCGAT GCGGTGGTGA CCGGGCTCAA GGGATCCATC GGCGACCGGC TTCTCTCCGC CTTCTTCGGC CGGAAGCAGC AGGGCGATCT GGCCGAGCTG ATCGCGGGCA ACACGCTCGT CTCGACCCGT CCCGAGCTCG CCGCTCTGGT CAAGGAGGGC GAAAGCTTCG TCGAGGCCCT GCAGCGGCTG AGCGCGGCCA TGTCCGGGGT CAACGGCGTC ATGGACACGC TGGGGCACAG CTTCCGCGCC GTGGATATGG TGACCGCCGG CATGGCCTCG GATCTGGCCG CGCTCTTCGG CGGGCTCGAC GGGCTCGTCT CCGCCACCTC CTCCTATTAC CAGGCCTTCT ATAGCGAGGC CGAGCGGATG GAGACCGCGA CCCGGCAGGC GACCGAGGCG CTGGCCGAGA TGGGCGTGGC CCTTCCGCAG ACCCGCGCCG AATATCGCCG GCTGGTCGAG GCGCAGGATC TGACCACCGA GCGCGGTCGG GAGCTTTATG CGGCCCTCGT CGGCATGGCC GGCGTCATGG ATCAGATCCT CCCGAGCGTC GCCGGCCTCT CGGCCGGGCT GGCGGGGCTC GTGGGCACCA TCTCCACGGA TCTCGACGGC ATGATCTCCG GCGCGGCCGA GGCGCAGCGG GCGGCGGCCG CGGCGGCGAA GGGCTGGTAT CAGGTCACGG TGGCGCTGCG CGATTACATC GGCGACCTGC GCTCGGCCGC GTCCGAGCTG ATCTCGCCCG CGGTCGCGGC GGCGCAGTCG CAGGCGCGCT ACCAGACCAT GCTGGCGAGC GCGATGGCGG GGGATCAGGA GGCGGCCAAG GCCGTCTCCG GCGCGGCCTC GGCCTATATC GACGCCGTGC GCGGGCAGGC CCGGTCGGCG GTGGATGTGG CGCGCGCGCA GGCGCAGGTG CTCTCCGACC TGCAGCTCCT GCAGGGCGTG ACCGGCCTCG AGGGGGCGAA GGAGGATGTG CTGGCCAGCC TCTATCGGGA GCAGGTCGAT CTCCTGACCG AGGTGCGGGA CTATCTGACC GGTGGCGAGG CGCTGAAGCC CGAGCAGATT GCGGCGTTGA ATGCTCAGCT GGGCTCCCTC GAAGGCGCCA TCGCCGCGGC GAAGGAGATC TCCTACGCCG CGCTCCGCGA GCGGATCGAT GTGACCGTGG GGCTGACGGC GACGGCCGCG ATCCCGGCCG ACCTGCGCCG CATCCTGAAG AATGCCACGA GCGGCGTCGA GGTCTCGCTC GACATGGTGC TGCGGCGGAT GGATCTCTCG CTCGATCTGG TCTGGATCGC GGCGAAGGCC TCGTCCGACC ACCTCGCGCG CATCGACTTT CTGGCGAAGA CCGACGCGCT GCCCGACGAT CTGCGCGCGC TCGCCGCCGT CCGCGTGGCG CAGTCGGTGC GCAGGCTCGC GCTGGTGATG GACAGGCCCG CCTCCGATCT CGGCATGGCG GAACTCCTGA AGGCCCTCGG CGCCCAGGGC GGCCGGATCA CGCTCGGAGG CTCCTTCGCC TTCGACCCCT CGACCGGCTT CTCGACCTGG TTCGAGAGCA CCACCCGAAC CACGCTCACC GCCCCCATGG GCGCTCTGCG CACCGCGCTC GACGATCTGA GGGCCGCGAT CCTCGCCCAA GAGCGCGCGG CCGGGCAGCG CGAGCGCGGA GCGGCTCTCT CGGCCTTCGC CGGAGGCCTC GCCACCAATG CCGCCGGCGA CATCCTCGCC ACCGACAAGC AGATCATGGC GATGGCGGCC AAGGCCGGGA TCTCGACCGA TGGCAAGACC ATCGGGCAGG TGATGCGGGC CGTCGAGGGC TTCTCGCCGC TCGACGGGAT CGAGACGATC CGCCGGCTGC CGGGGAGCCT GAAGGACTAT CTCTGGGGCC TCTTCCAGCA GCGGCAGGGC CGGATCCCGC TCGATACCGC CGATTACCTG CGGCTCTACC CGGACGTGGC CGCGGATGAG TACGGCTACG ACCCGACCAT CCACTACCGC AACCACGGCC GCGAGGCGAT CCTCGCTGGC CTGCGGCCGT TCCGGCCGGA GGTGTTCGAC TGGTCGGCCA TCGGCCTCGA CGTCCCGGGC TTCGCCGCGG GCGGGCTCCA TGCGGGCGGC CTGCGCCTCG TGGGCGAGCT CGGGCCCGAG CTCGAGGCCA CCGGCCCGAG CCGCATCCAC AGTGCGGGGC GGACCGCCGA CATCCTCGGC GGCGCCGCCA TGGGCGCCTC CGAGGTGGCC GGGGCCGTGC GCGACCTGCA GGCCGAGCTC GTGGCTCTGC GGGCCGAGCT CGCAGAGATG AAGGTCTGGG CTCGCAAGGG GGCCGAGGCC TCCACCGCCA CCGCGAAGGA CCTGCGCCGG ATCGGAACGG TGGGGGTGCG GATCGACCCG ACGGAGGCCG TCTGA
|
Protein sequence | MSASNMGAMR ATLGLDVSQF ENRAQSAGRT AKQMSDAMAR AFEVAKASAM GGARSFEELR ASIDPTFAAT QRYAAIQREL AGMVESGAAS QRAANLVLEQ AAAKYMGVET AAERTARAQR ETSAAADAAA RGYTSLRAQV DPLYAASKRY EQALETLNAA QAAGVIGDQE RARTLKLLDA QMISADRATT AATLGIGRFT PAITNASFQV QDFAVQVASG QSAMIAFTQQ APQLLGAFGF SGKLALIGAG LGTILAIGAA LVPVFQRMAA GTAGLKEKID DLTKSVDGYK SASERAHKSA MELTAEFGAN AASAREAYAA LQSLAQLTAV QDLRKAMEGI GDAIPSSFGR LISQLDATGR VGAQAAANLR TQFGLTAEEG KRLAEAIHAV GASSGPEEAA KRAADLHRTM VDVFGAVEAI PPEFQTLARL AAEGNVAALQ LLGTMNGLTG SISSAASEAA RLAANLGSAA NSAAAEASRQ ISIVDAQMAA IRAGQDEVIA GKRAAIELDR QAYVAAKMAA GMDADRAESL ASQVFASQEI LAVREEELRA MQKARSEAEK AANGGAKAAA AEAKALDKSA RKYLEMIDPM EKYRRKQAEL KKLLDAGRIS ADQYRRALAE IAAEMGENNP VFEEFRSAVG SAVDWMLDGF RGGFKGLLDI AKNTLRQIIG MFMTNRITLS LGLGVSGAAA GAAGAAVAGA GGMGTLSALG GIASGINAVL GGIGGALSAF GTGAWGALSN FATGGLSGGL AYIGSSLNFA TSGLAGFAQA AGAILGPIAA VAAAFSFFGS KTKLLDAGLR VTVRELNAMV ESYRKVEKSR FGGLSKSRRT SYGLADGAVA GPIVKAVGQM QASVMDVADT LGIGAEAFKG FAASVKFSTK GLSDEEIGAK LQEKLTELGD NFAARAFGYV GKNDQAIRDL EKRIAEGTSD AVVTGLKGSI GDRLLSAFFG RKQQGDLAEL IAGNTLVSTR PELAALVKEG ESFVEALQRL SAAMSGVNGV MDTLGHSFRA VDMVTAGMAS DLAALFGGLD GLVSATSSYY QAFYSEAERM ETATRQATEA LAEMGVALPQ TRAEYRRLVE AQDLTTERGR ELYAALVGMA GVMDQILPSV AGLSAGLAGL VGTISTDLDG MISGAAEAQR AAAAAAKGWY QVTVALRDYI GDLRSAASEL ISPAVAAAQS QARYQTMLAS AMAGDQEAAK AVSGAASAYI DAVRGQARSA VDVARAQAQV LSDLQLLQGV TGLEGAKEDV LASLYREQVD LLTEVRDYLT GGEALKPEQI AALNAQLGSL EGAIAAAKEI SYAALRERID VTVGLTATAA IPADLRRILK NATSGVEVSL DMVLRRMDLS LDLVWIAAKA SSDHLARIDF LAKTDALPDD LRALAAVRVA QSVRRLALVM DRPASDLGMA ELLKALGAQG GRITLGGSFA FDPSTGFSTW FESTTRTTLT APMGALRTAL DDLRAAILAQ ERAAGQRERG AALSAFAGGL ATNAAGDILA TDKQIMAMAA KAGISTDGKT IGQVMRAVEG FSPLDGIETI RRLPGSLKDY LWGLFQQRQG RIPLDTADYL RLYPDVAADE YGYDPTIHYR NHGREAILAG LRPFRPEVFD WSAIGLDVPG FAAGGLHAGG LRLVGELGPE LEATGPSRIH SAGRTADILG GAAMGASEVA GAVRDLQAEL VALRAELAEM KVWARKGAEA STATAKDLRR IGTVGVRIDP TEAV
|
| |