Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1237 |
Symbol | |
ID | 5165729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 1462966 |
End bp | 1464852 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640548741 |
Product | YD repeat-containing protein |
Protein accession | YP_001230014 |
Protein GI | 148263308 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACCGGC CAACGCACAA ACATTACCCG GCAGATTCGG GTGAAACAGA CGTAGTCTTT ACTTACGACG AAACGACCTC AATCAATCCG TTTGGCAGGC TTACCTCCAT GACAGACGGT TCCGGGAAGA CCACCTACAA TTATGACCTC TCCGGCAGGG TGACAAGAAC CGTCAAAACG GTAGACAACG TCGACTATAC CATTGTGAAA TCCTATGACG GCATGGGAAG GTTGAACAAC ATCACTTACC CGGATGGCGA ATCCGTTGAT TACCATTACG ACGCGGCTGG CAACCTGTTC GACATCAGCG GCTATGCGGA ATACGACGAG TACAACGCCC TCGGCCAGCC AGGCAAACTC TACTATGGCA ACGGCATCGT CACCAGTTAC TGGTACTATC CGCAAAATTA CCGCCTTTTT GCCATGAAGG CCGATCAGCT GCAGAATAGT CTGCTTTACC GCACCTACAA CTATGACAGC AAGGGGAATA TGATTGACCT GAATGACCTT GTCAATCCCT CCATCCCGCA TAACCTTACT TATGGCTCGG TCGGCTATAC CCTTGATCCG ACCCGTGCCC ATGCCGTTCA GACCGCCTCG ACCGCACCCG GTAGGGTCTA CCAGTACGAC GACAACGGCA ATATGACATC CGACGGCCAG AGGACGGTAA CGTATACCCC GGACAACTTG CCGAAAACTG CTACCATGAA CGGTGTCACA ACGACGTTCA TTTATGACGG CAACGGAAAA AGGGTGAAAA AGTTAAACTC GAATACAAGG ATGTACATCG GTAAATTGTA CGAATGCATC AATGGTAACT GCGGCAATTA CATCTTTGCC GGAAATACCA GAATTGCGAG GAACTTCCAG GGTTCGACCA TATATTACCA CCCTGACCAC CTTGGAAGTA CGGCGATTAC GACCGATGAC TACGGCAACA AGGTGGAGGA CATCTACTAC TACCCTTACG GCGAAACAAG GCTGGACAGC AAGCCAATAG GTGGCATGAC TCATAAGTAC ACCGCCCAGG AGCTGGATTA TGAAACAGGA CTCTATAACT ACGGTGCAAG ACTTTATGAC CCCGATATTG GGCGGTTTAT TTCTCCCGAT AGTATCGTGC CTAGTCCGGG GAACCCGCAG AGTCTGAACA GGTATGCTTA TGCGCTGAAT AATCCGGTCA AATACAGAGA CCCTAGCGGC CATTCGCCCA GGTCTTGCGC ATTGGGGTTC TGTAGCACTG TTGTTAATAC AGCGGCAGTT TATGCAGCTC CGACCGGCGT TGGTGGTGCC ATTGTTAAAG GTACTGGAAT TGGCTTGTCT TTATTATCAA TGGGGAATGC GTATTATGAG CACAAAACTG GAACTATGTC GGACTTTGAT TACAAAGCAA CTATGGGGCT GGAATCACTC AATCTTCTTA GTGTTGGAAG CGCTGCAGCT GCGGCAAAGA TAGGAAGAAA CATTGAAACA GTTGTTGAAA CTGCAGAAGT TGCGGACAGG TCAATCGCTA CAATGACTCG CGCCGCTGGA GTTCCTTTAT TTACAAAAGA TGTTTTTGAG ACTTATGGTG AACATTCTTC ATCGACTGCT AAAGGTAATG CATTCGACCA ATCCCTTTTG GGTTCTGGTC AAATTGATGG TAGTTTGTCT TATCTGTTAT CTCAATCGCC TATGCCCGAT TTACCTTCAA TTCCTACTTA TACTTTACCG ACTTATGATT TCAACAGTTG GTATCAAAAC TACACACCTG GCAATAATTC GACTTCTGGT AATACTGGCT CTACATCCGA TAATTCTGGT TCAACGACTG GTAATTCAGG TTCTTCGGGC GGTGGGTCTG ATGATGATGG TGGAGATGAT GGTGATGCTG GAGGTGATTG GTGGTGA
|
Protein sequence | MNRPTHKHYP ADSGETDVVF TYDETTSINP FGRLTSMTDG SGKTTYNYDL SGRVTRTVKT VDNVDYTIVK SYDGMGRLNN ITYPDGESVD YHYDAAGNLF DISGYAEYDE YNALGQPGKL YYGNGIVTSY WYYPQNYRLF AMKADQLQNS LLYRTYNYDS KGNMIDLNDL VNPSIPHNLT YGSVGYTLDP TRAHAVQTAS TAPGRVYQYD DNGNMTSDGQ RTVTYTPDNL PKTATMNGVT TTFIYDGNGK RVKKLNSNTR MYIGKLYECI NGNCGNYIFA GNTRIARNFQ GSTIYYHPDH LGSTAITTDD YGNKVEDIYY YPYGETRLDS KPIGGMTHKY TAQELDYETG LYNYGARLYD PDIGRFISPD SIVPSPGNPQ SLNRYAYALN NPVKYRDPSG HSPRSCALGF CSTVVNTAAV YAAPTGVGGA IVKGTGIGLS LLSMGNAYYE HKTGTMSDFD YKATMGLESL NLLSVGSAAA AAKIGRNIET VVETAEVADR SIATMTRAAG VPLFTKDVFE TYGEHSSSTA KGNAFDQSLL GSGQIDGSLS YLLSQSPMPD LPSIPTYTLP TYDFNSWYQN YTPGNNSTSG NTGSTSDNSG STTGNSGSSG GGSDDDGGDD GDAGGDWW
|
| |