Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3368 |
Symbol | |
ID | 4898624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 416620 |
End bp | 421794 |
Gene Length | 5175 bp |
Protein Length | 1724 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113967 |
Product | hypothetical protein |
Protein accession | YP_001045236 |
Protein GI | 126464123 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATGT CCAATGTCGG CGCCATGCGC GCCACCCTCG GCCTCGACGT CTCTCAGTTC GAGAACCGGG CCCAGTCCGC CAGCCGGACC GCAAAGCAGA TGTCCGACGC CATGGCCCGG GCGTTCCAGG TTGCGAAGGC CTCTGCCATG GGCGGCGCCC GGAGCTTCGA GGAGTTGCGG GCCTCGATCG ATCCGACCTT CGCCGCGACG CAACGCTATG CCGTCATCCA GCGCGAGCTC GCAGGCATGG TGGAGAGCGG CGCCGCCAGC CAGCGCGCGG CGAACCTCGT CCTCGAGCAG GCGGCGGCGA AGTACATGGG GGTGGAGACG GCGGCCGAGC GGACGGCACG GGCGCAGCGA GAGACATCTG CCGCGGCGGA CGCGGCCGCC CGCGGCTATA CCGCGCTCCG GGCGCAGGTC GATCCGCTCT ATGCGGCCTC GAAGCGGTAC GAGCAGGCGC TCGAGACGCT GAATGCGGCG CAGGCGGCGG GGGTCATCGG CGATCAGGAA CGCGCGCGGA CGCTCAAGCT GCTCGACGCC CAGATGATCT CGGCGGATCG TGCGACGGCG GCGGCCACAC AGGGCATCGG CCGGTTCACG CCCGCGATCA CCAATGCCTC GTTTCAGGTG CAGGACTTCG CGGTGCAGGT CGCCTCGGGT CAGTCGGCGA TGATCGCCTT CACCCAGCAG GCGCCCCAGC TTCTTGGCGC CTTCGGATTC TCCGGGAAGC TGGCCCTGAT CGGGGCGGGC CTCGGGACGA TCCTCGCGAT CGGCGCCGCC CTTGTGCCCG TGTTCCAGCG CATGGCCGCC GGGACCGCGG GACTGAAGGA AAAGATCGAC GATCTGACGA AGTCGGTGGA CGGCTACAAG AGCGCCTCGG GACGCGCCCA CAAGTCGGCC ATGGAACTCA CCGCCGAGTT CGGCGCCAAC GCCGCGAGCG CGCGGGAGGC CTACACCGCT CTGCAGAGCC TGGCGCAACT CACCGCCGTG CAGGATCTTC GCAAGGCCAT GGAGGGCATC GGGGACGCCA TTCCCTCGTC CTTCGGTCGC CTTGTATCGC AGCTGGATGC CACGGGCCGC GTCGGGGCGC AGGCCGCCGC CAACCTCCGC ACTCAGTTCG GGCTGACCGC GGAGGAAGGG AAGCGTCTCG CCGAGGCCAT CCACGCCGTG GGAGCCTCCT CCGGCCCCGA GGAGGCGGCA AAGCGCGCGG CCGAGCTTCA TCGGACGATG GTCGACGTGT TCGGAGCGGT GGAATCCATC CCGCCCGAGT TCCAGACTCT CGCGCGCCTC GCGGCCGAGG GCAATGTCGC GGCTCTGCAG TTGCTCGGGA CCATGAACGG CCTGACCGGC AGCATCTCCT CTGCGGCATC GGAAGCGGCG CGGCTGGCTT CAAACCTCGG CTCCGCGGCC AACAGCGCCG CGGCGGAGGC CTCGCGCCAG ATCTCGATCA TCGATGCGCA GATGGCCGCG ATCCGCGCCG GTCAGGACGA GGTGATCGCG GGCAAGCGCG CAGCCATCGA TCTGGATCGG CAGGCCTATG TGGCGGCGAA GATGGCGGCG GGCATGGATG CGGATCGCGC CGAGTCTCTC GCGTCGCAGG TCTTCGCCTC GCAGGAGATC CTCGCCGTCC GGGAAGAGGA GCTGCGGGCG ATGCAGAAAG CCCGCTCGGA GGCCGAGAAG GTCGCCAACG GAGGCGCCAA GGCGGCTGCG GCGGAGGCGA AGGCGCTGGA CAAGAGCGCC CGGAAATACC TCGAGATGAT CGACCCGATG GAGAAGTATC GCCGGAAACA GGCCGAGCTG AAGAAGCTCC TCGATGCCGG CAGGATCTCG GCGGACCAGT ATCGGCGGGC ACTGGCAGAG ATCGCGGCCG AGATGGGCGA GAATAACCCC GTGTTCGAGG AGTTCCGCAG CGCCGTCGGG AGCGCGGTCG ACTGGATGCT CGACGGCTTC CGCGGCGGCT TCGACGGCCT CCTCGACATC GCGAAGACCA CGCTGAAGCA GATCATCGGC ATGTTCATGA CGAACCGGAT CACGCTCTCG CTCGGCCTCG GGGTCTCGGG CGCGGCCGCA GGCGCTGCCG GGGCAGCCGT TGCGGGCGCG GGCGGCATGG GCACTCTCGG CGCGCTCGGC GGCATCGCGA GCGGGATCAA TGCGGTGCTC GGCGGCATCG GCGGCGCGCT TTCCGCCTTC GGCACCGGCG CCTGGGGCGC GCTCTCGAAC TTTGCAACGG GCGGCCTGTC CGGCGGCCTG GCCTATATCG GCAGCTCCCT GAACTTTGCC ACGAGCGGTC TGGTCGGGTT CGCGCAGGCG GCGGGCGCGA TCCTCGGCCC CATTGCCGCC GTGGCGGCGG CCTTCTCCTT CTTCGGCTCG AAGACGAAGC TCCTCGATGC AGGCCTGCGC GTCACGGTGC GCGAGCTGAA TGCGATGGTG GAGAGCTACC GGAAGGTCGA GAAGTCCCGG TTCGGCGGGC TCTCGAAGTC GCGGCGCACG AGCTACGGCC TCGCGGACGG TGCGGTGGCC GGCCCCATCG TCAAGGCCGT GAGCCAGATG CAGGCCTCGG TCATGGATGT GGCGGACACG CTCGGCATCG GGGCCGAGGC CTTCAAGGGC TTTGCGGCCT CCGTGAAGTT CTCGACCAAG GGGCTCTCCG ACGAGGAGAT CGGGGCGAAG CTGCAGGAGA AGCTCACCGA GCTCGGCGAC AACTTCGCCG CCCGCGCCTT CGGCTATGTC GGAAAGAACG ACCAGGCGAT CCGGGACCTC GAGAAGCGGA TCGCGGAGGG GACGTCCGAT GCGGTGGTGA GCGGGCTCAA GGGATCCCTC GGCGACAAGA TTCTCTCCGC CTTCCTCGGC CGAAAGATGC AGGGCGACCT GGCCGACCTG ATCGCGGGCA ACACGCTCGT CTCGACCCGC CCCGAGCTCG CCGCTCTGGT CAAGGAGGGC GAGAGCTTCG TCGAGGCCCT GCAGCGGCTG AGCGCGGCCA TGTCCGGGGT CAACGGCGTG ATGGACACGC TCGGCCACAG CTTCCGGGCG GTGGACATGG TGACAGCCGG CATGGCCTCG GATCTGGCGG CGCTCTTCGG CGGGCTGGAG GGATTGGTCT CCGCCACCAC CGGCTATTAT CAGGCCTTCT ATAACGAGGC CGAGCGGATG GAGACCGCGA CCCGGCAGGC GACCGAGGCG CTGGCGAAGC TGGGTGTGGC CCTTCCCGAG ACCCGGGCGG AGTATCGCCG GCTGGTCGAG GCGCAGGATC TCACCACCGA GCGGGGTCGA GAGCTCTACG CCGCCCTCGT CGGCATGGCG GGCGTCATGG ATCAGATCCT GCCGAGCGTG GCCGGCCTCT CGGCCGGGCT GGCGGGGCTC GTGGGCACGA TCACCACCGA TCTCGACGGG ATGATCTCTG GGGCGGCCGA AGCGCAGCGG GCGGCGGCCG CGGCAGCGAA GGGCTGGTAT CAGGTCACGC TGTCTCTGCG CGATTATATC GGCGACCTGC GCTCTGCCGC CTCCGAGCTG ATCGGCCCCG CGGTGGCGGC GGCGCAGTCG CAGGCGCGCT ACCAGACGAT GCTGGCGAGC GCGATGGCAG GCGATCAGGA GGCGGCCAAG GCCGTCTCCG GCGCGGCCTC GGCCTATATC GACGCCGTGC GCGGGCAGGC CCGGTCGGCG GTGGATGTGG CCCGCGCGCA GGCGCAGGTG CTCTCGGACC TGCAGCTCCT GCAGGGCGTG ACCGGCCTCG AGGGGGCGAA GGAGGATGTG CTGGCCAGCC TCTATCGGGA GCAGGTCGAT CTCCTGACCG AGGTGCGGGA CTATCTGGGC CAGGGCGGCC TCCTCGACCC CGCCCGGATC GACGCGCTCA ACGGCCAGCT CGGATCGCTC GAAGGCGCCA TCGCCGCGGC GAAGGAGATC TCCTACGCCG CGCTCCGCGA GCGGATCGAC GTGACCTTGG GGCTGACGGG GACGGCCCAG ATCCCGGCCG ATCTGCGCCG CATCCTGAAG AATGCCACGA GCGGCGTCGA GGTCTCGCTC GACATGGTGC TGCGGCGGAT GGATCTGACG CCCGATCTGG TCTGGATCGC GGCGAAGGCC TCCTCCGACC ACCTCGCGCG GATCCGCTAT CTGGCGAAGA CCGACGCGCT GCCGGACGAT CTGCGCGCCA TCGCCGCCGT CCGCGTGGCG CAGTCGGTGC GCCGGCTCGC GCTGGTGATG GACCGGCCCA GCTCCGATCT CGGCATGGCG GAGCTCCTGA AGGCCCTTGG CGCCCAGGGC GGCCGGATCA CCCTCGGCGG CAGCTTCGCC TTCGATCCCT CGACCGGCTT CTCGAGCTGG TTCGAGACCA CGACGCGGGG GGCGATCACG GCGCCGATGA CGGCGCTGCG CACCGCGCTC GGCGATCTGG CGGCCGCCGT GCGGGCAGAA ACGGCCGCCG CGACGAAGAG GGGACAGGGG GCGGCGCTCT CGGCCTTCGC TGGGGGCCTC GCGACCAATG CGGCCGGCGA CATCCTCGCC ACGGACAAGC AGATCATGGC GATGGCCGCC AAGGCCGGGA TCTCGACCGA CGGCAAGACC ATCGGGCAGG TGATGCGGGC CATCGAGGGC TTCTCGCCGC TCGACGGGAT CGAGACGATC CGCCGGCTGC CGGGGAGCCT GAAGGACTAC CTCTGGGGCC TCTTCCAGCA GCGGCAGGGC CGGATCCCGC TCGATACCGC CGATTACCTG CGGCTCTATC CGGACGTGGC GGCGGACGAA TATGGCTACG ACCCGACCAT CCATTACCGC AACCACGGCC GCGAGGCGAT CCTCGCGGGC CTGCGGCCGT TCCGGCCGGA AGTGTTCGAC TGGTCGGCCA TCGGCCTCGA CGTCCCGGGC TTCGCCGCGG GCGGGCTCCA TGGCGGCGGC CTGCGCCTCG TGGGCGAGCT CGGGCCCGAG CTCGAGGCCA CCGGCCCGAG CCGGATCCAC AGCGCGGGGC GGACCGCCGA CATCCTCGGC GGCGCCGCCA TGGGCGCCTC CGAGGTGGCC GGCGCCGTGC GCGATCTGCA GGCCGAGCTC GTGGCTCTGC GGGCCGAGCT CGCAGAGATG AAGGTCTGGG CCCGCAAGGG GGCCGAGGCC TCCACCGCCA CCGCGAAGGA CCTGCGCCGG ATCGGCACGG TGGGGGTGCG GATCGACCCG ACGGAGGCCG TCTGA
|
Protein sequence | MSMSNVGAMR ATLGLDVSQF ENRAQSASRT AKQMSDAMAR AFQVAKASAM GGARSFEELR ASIDPTFAAT QRYAVIQREL AGMVESGAAS QRAANLVLEQ AAAKYMGVET AAERTARAQR ETSAAADAAA RGYTALRAQV DPLYAASKRY EQALETLNAA QAAGVIGDQE RARTLKLLDA QMISADRATA AATQGIGRFT PAITNASFQV QDFAVQVASG QSAMIAFTQQ APQLLGAFGF SGKLALIGAG LGTILAIGAA LVPVFQRMAA GTAGLKEKID DLTKSVDGYK SASGRAHKSA MELTAEFGAN AASAREAYTA LQSLAQLTAV QDLRKAMEGI GDAIPSSFGR LVSQLDATGR VGAQAAANLR TQFGLTAEEG KRLAEAIHAV GASSGPEEAA KRAAELHRTM VDVFGAVESI PPEFQTLARL AAEGNVAALQ LLGTMNGLTG SISSAASEAA RLASNLGSAA NSAAAEASRQ ISIIDAQMAA IRAGQDEVIA GKRAAIDLDR QAYVAAKMAA GMDADRAESL ASQVFASQEI LAVREEELRA MQKARSEAEK VANGGAKAAA AEAKALDKSA RKYLEMIDPM EKYRRKQAEL KKLLDAGRIS ADQYRRALAE IAAEMGENNP VFEEFRSAVG SAVDWMLDGF RGGFDGLLDI AKTTLKQIIG MFMTNRITLS LGLGVSGAAA GAAGAAVAGA GGMGTLGALG GIASGINAVL GGIGGALSAF GTGAWGALSN FATGGLSGGL AYIGSSLNFA TSGLVGFAQA AGAILGPIAA VAAAFSFFGS KTKLLDAGLR VTVRELNAMV ESYRKVEKSR FGGLSKSRRT SYGLADGAVA GPIVKAVSQM QASVMDVADT LGIGAEAFKG FAASVKFSTK GLSDEEIGAK LQEKLTELGD NFAARAFGYV GKNDQAIRDL EKRIAEGTSD AVVSGLKGSL GDKILSAFLG RKMQGDLADL IAGNTLVSTR PELAALVKEG ESFVEALQRL SAAMSGVNGV MDTLGHSFRA VDMVTAGMAS DLAALFGGLE GLVSATTGYY QAFYNEAERM ETATRQATEA LAKLGVALPE TRAEYRRLVE AQDLTTERGR ELYAALVGMA GVMDQILPSV AGLSAGLAGL VGTITTDLDG MISGAAEAQR AAAAAAKGWY QVTLSLRDYI GDLRSAASEL IGPAVAAAQS QARYQTMLAS AMAGDQEAAK AVSGAASAYI DAVRGQARSA VDVARAQAQV LSDLQLLQGV TGLEGAKEDV LASLYREQVD LLTEVRDYLG QGGLLDPARI DALNGQLGSL EGAIAAAKEI SYAALRERID VTLGLTGTAQ IPADLRRILK NATSGVEVSL DMVLRRMDLT PDLVWIAAKA SSDHLARIRY LAKTDALPDD LRAIAAVRVA QSVRRLALVM DRPSSDLGMA ELLKALGAQG GRITLGGSFA FDPSTGFSSW FETTTRGAIT APMTALRTAL GDLAAAVRAE TAAATKRGQG AALSAFAGGL ATNAAGDILA TDKQIMAMAA KAGISTDGKT IGQVMRAIEG FSPLDGIETI RRLPGSLKDY LWGLFQQRQG RIPLDTADYL RLYPDVAADE YGYDPTIHYR NHGREAILAG LRPFRPEVFD WSAIGLDVPG FAAGGLHGGG LRLVGELGPE LEATGPSRIH SAGRTADILG GAAMGASEVA GAVRDLQAEL VALRAELAEM KVWARKGAEA STATAKDLRR IGTVGVRIDP TEAV
|
| |