Gene Gura_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1237 
Symbol 
ID5165729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1462966 
End bp1464852 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content48% 
IMG OID640548741 
ProductYD repeat-containing protein 
Protein accessionYP_001230014 
Protein GI148263308 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACCGGC CAACGCACAA ACATTACCCG GCAGATTCGG GTGAAACAGA CGTAGTCTTT 
ACTTACGACG AAACGACCTC AATCAATCCG TTTGGCAGGC TTACCTCCAT GACAGACGGT
TCCGGGAAGA CCACCTACAA TTATGACCTC TCCGGCAGGG TGACAAGAAC CGTCAAAACG
GTAGACAACG TCGACTATAC CATTGTGAAA TCCTATGACG GCATGGGAAG GTTGAACAAC
ATCACTTACC CGGATGGCGA ATCCGTTGAT TACCATTACG ACGCGGCTGG CAACCTGTTC
GACATCAGCG GCTATGCGGA ATACGACGAG TACAACGCCC TCGGCCAGCC AGGCAAACTC
TACTATGGCA ACGGCATCGT CACCAGTTAC TGGTACTATC CGCAAAATTA CCGCCTTTTT
GCCATGAAGG CCGATCAGCT GCAGAATAGT CTGCTTTACC GCACCTACAA CTATGACAGC
AAGGGGAATA TGATTGACCT GAATGACCTT GTCAATCCCT CCATCCCGCA TAACCTTACT
TATGGCTCGG TCGGCTATAC CCTTGATCCG ACCCGTGCCC ATGCCGTTCA GACCGCCTCG
ACCGCACCCG GTAGGGTCTA CCAGTACGAC GACAACGGCA ATATGACATC CGACGGCCAG
AGGACGGTAA CGTATACCCC GGACAACTTG CCGAAAACTG CTACCATGAA CGGTGTCACA
ACGACGTTCA TTTATGACGG CAACGGAAAA AGGGTGAAAA AGTTAAACTC GAATACAAGG
ATGTACATCG GTAAATTGTA CGAATGCATC AATGGTAACT GCGGCAATTA CATCTTTGCC
GGAAATACCA GAATTGCGAG GAACTTCCAG GGTTCGACCA TATATTACCA CCCTGACCAC
CTTGGAAGTA CGGCGATTAC GACCGATGAC TACGGCAACA AGGTGGAGGA CATCTACTAC
TACCCTTACG GCGAAACAAG GCTGGACAGC AAGCCAATAG GTGGCATGAC TCATAAGTAC
ACCGCCCAGG AGCTGGATTA TGAAACAGGA CTCTATAACT ACGGTGCAAG ACTTTATGAC
CCCGATATTG GGCGGTTTAT TTCTCCCGAT AGTATCGTGC CTAGTCCGGG GAACCCGCAG
AGTCTGAACA GGTATGCTTA TGCGCTGAAT AATCCGGTCA AATACAGAGA CCCTAGCGGC
CATTCGCCCA GGTCTTGCGC ATTGGGGTTC TGTAGCACTG TTGTTAATAC AGCGGCAGTT
TATGCAGCTC CGACCGGCGT TGGTGGTGCC ATTGTTAAAG GTACTGGAAT TGGCTTGTCT
TTATTATCAA TGGGGAATGC GTATTATGAG CACAAAACTG GAACTATGTC GGACTTTGAT
TACAAAGCAA CTATGGGGCT GGAATCACTC AATCTTCTTA GTGTTGGAAG CGCTGCAGCT
GCGGCAAAGA TAGGAAGAAA CATTGAAACA GTTGTTGAAA CTGCAGAAGT TGCGGACAGG
TCAATCGCTA CAATGACTCG CGCCGCTGGA GTTCCTTTAT TTACAAAAGA TGTTTTTGAG
ACTTATGGTG AACATTCTTC ATCGACTGCT AAAGGTAATG CATTCGACCA ATCCCTTTTG
GGTTCTGGTC AAATTGATGG TAGTTTGTCT TATCTGTTAT CTCAATCGCC TATGCCCGAT
TTACCTTCAA TTCCTACTTA TACTTTACCG ACTTATGATT TCAACAGTTG GTATCAAAAC
TACACACCTG GCAATAATTC GACTTCTGGT AATACTGGCT CTACATCCGA TAATTCTGGT
TCAACGACTG GTAATTCAGG TTCTTCGGGC GGTGGGTCTG ATGATGATGG TGGAGATGAT
GGTGATGCTG GAGGTGATTG GTGGTGA
 
Protein sequence
MNRPTHKHYP ADSGETDVVF TYDETTSINP FGRLTSMTDG SGKTTYNYDL SGRVTRTVKT 
VDNVDYTIVK SYDGMGRLNN ITYPDGESVD YHYDAAGNLF DISGYAEYDE YNALGQPGKL
YYGNGIVTSY WYYPQNYRLF AMKADQLQNS LLYRTYNYDS KGNMIDLNDL VNPSIPHNLT
YGSVGYTLDP TRAHAVQTAS TAPGRVYQYD DNGNMTSDGQ RTVTYTPDNL PKTATMNGVT
TTFIYDGNGK RVKKLNSNTR MYIGKLYECI NGNCGNYIFA GNTRIARNFQ GSTIYYHPDH
LGSTAITTDD YGNKVEDIYY YPYGETRLDS KPIGGMTHKY TAQELDYETG LYNYGARLYD
PDIGRFISPD SIVPSPGNPQ SLNRYAYALN NPVKYRDPSG HSPRSCALGF CSTVVNTAAV
YAAPTGVGGA IVKGTGIGLS LLSMGNAYYE HKTGTMSDFD YKATMGLESL NLLSVGSAAA
AAKIGRNIET VVETAEVADR SIATMTRAAG VPLFTKDVFE TYGEHSSSTA KGNAFDQSLL
GSGQIDGSLS YLLSQSPMPD LPSIPTYTLP TYDFNSWYQN YTPGNNSTSG NTGSTSDNSG
STTGNSGSSG GGSDDDGGDD GDAGGDWW