Gene Gura_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0689 
Symbol 
ID5164595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp822257 
End bp824155 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content49% 
IMG OID640548191 
ProductTPR repeat-containing protein 
Protein accessionYP_001229474 
Protein GI148262768 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000734837 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT ATTCGACCCT TCTTATGCTG GCTGTTCTGC CGCTTCTCAC CTGTGGGTTT 
GATTGGGGTT TCGGCTTAAA GGACAATTGC CGCGAAGCAA AAAAGATAGC TGTCAGCTTG
GCTGATATCA AGGTCGATGC CGCACGGAGC GAAGCGGAAT CACGAATAAT AAAGCTTTGC
CCTGATGGAG CTGCGGGGCA TTTTATGAAG GCGCTCGATC TGGAGCGAAA TGGAAACCTC
GACGGGGCCG TCACGGAGTA TCAGGAATCC TTGAAAGATG ACCCGGATTT CGGACCGGCC
AGTGGCAATC TCGGCCTCAT CTATTTACAG AAGGGTTTGC AGGATGATGC CGCAGTCGAG
CTGACCAAAG CGATAAAAAC CACCCCTTCT CCCGCCTATA ACAAGGCTCT CGGCAAGATA
TTCAGTGACA AGAAACTCTA CAATCTGGCT TTGTATCATT ACAACGAGGC TATAAACGCC
CTGCCGGCAG ACACATCTCT TTATGCTGAC CTTGCCGGAG TATATACCGG CATGGGACTC
ATAAACAGCG CCGAAGAGGA ATACAACAAG GTGCTGGCAA TCGAGCCGGG CAATCTCAAC
GCCAGACTAG GCCTGGCGGC GCTCTTCAGC AGCAGGAACC AGTTTGACAA GGCGATCGGT
GAACTCAAAA AGGCCCAGGT GATCGATCCC GGGAATAAAA ACATCCATCG CCTGTTGGCG
GAAGCCTACG ACAAAAAGGG TGATAAAAAA AGCTCCGAGT ATGAATATAT TCTTGCCGGT
ATACCGGTCA AAACCGAAGA AGTGGCTCAA AGTAACCACC TGCGTCAGGG CGATGAATTC
GTAAAAAACA AGGAATACGA AAAAGCCGCG ACTGAATACA AGGCCGCATT GAAGGACAAG
CCTGAATGGC CTGAAGCCCT GCAAAAACTT GGGGATGCGC AGATGGCGGC AGGCCATGAT
GATGAAGCAA TCGCCAGCTA TCGTGAAGCA ATCCGCCTCA AGGCGGAAAA CGGCAACCTT
CACTACAATC TGGGAATTCT ATACGAACGC AAGGCGCTCC TCGATGAAGC AGTTGTCGAA
TACCGGCAGG CCCTCAACTA TACCGCAGAC AACGGCGACA CACGAAGACG GCTTGCAGAT
ATCTACACCC TGCGCGGCAG TTTTCCGCAA GCAATCGAGC AGTACCGCGA ACTGATCAAA
CTGAGAAAAG ACAACCCTCT CATCCATTTT AAACTGGCAA AGGTCTACGT TAACAGCAAG
GATTATCCGG CGGCAATTTC AGAATACCTT GAAACGACAA AACTGGACCC CGACAATATT
GAAGCGCACC GCGATCTGGC TGCCCTTTTC AGGAAGAAAA ATCAGAACGA AGAGGCGGAA
AAGGAATACC GCTCGATACT CCGCATGAAA AAAGACGATG TCGAAGCCCG TACGGCACTC
ACCTCCATCT ACGTAAAAAA CAAGAACTAT GACGAACTGA TCAACCTCCT CAAAGAAGGG
GTGGAACTGA ATCCTAAAGA CCCCAACAGC CATTATAAGC TGGGGCTGAT ATATGAATTC
AAAAAAGATT ATGATGCTGC CATCAGCCAG TATAAGGAAT CTGTAGCATT AAAGAGCGAT
CATGCAAAAG CCTTGAATGC CATGGGACGT GCCTACATGA AGAGCGGCCG CATATCCGAA
GCAAAAGAGG CGCTGGAAAC GGCAAAGAAA GCTGATCCCG AATTGGAAGA GACGACCGTC
CTGCTCAGCA ACATAAAGGA AGAGCTGTCG CCGGAACCCA AAAAGTATAA AAAGAAAGGG
CACAAGTCCA AAAAAGGGAA AGCCGTGAAA AAACGTGGTA AAAACAAGGC GGGAAAAACA
TCGAAAGCAA AAAAATCAAA AGCCAAAACA AAAAAATAA
 
Protein sequence
MKKYSTLLML AVLPLLTCGF DWGFGLKDNC REAKKIAVSL ADIKVDAARS EAESRIIKLC 
PDGAAGHFMK ALDLERNGNL DGAVTEYQES LKDDPDFGPA SGNLGLIYLQ KGLQDDAAVE
LTKAIKTTPS PAYNKALGKI FSDKKLYNLA LYHYNEAINA LPADTSLYAD LAGVYTGMGL
INSAEEEYNK VLAIEPGNLN ARLGLAALFS SRNQFDKAIG ELKKAQVIDP GNKNIHRLLA
EAYDKKGDKK SSEYEYILAG IPVKTEEVAQ SNHLRQGDEF VKNKEYEKAA TEYKAALKDK
PEWPEALQKL GDAQMAAGHD DEAIASYREA IRLKAENGNL HYNLGILYER KALLDEAVVE
YRQALNYTAD NGDTRRRLAD IYTLRGSFPQ AIEQYRELIK LRKDNPLIHF KLAKVYVNSK
DYPAAISEYL ETTKLDPDNI EAHRDLAALF RKKNQNEEAE KEYRSILRMK KDDVEARTAL
TSIYVKNKNY DELINLLKEG VELNPKDPNS HYKLGLIYEF KKDYDAAISQ YKESVALKSD
HAKALNAMGR AYMKSGRISE AKEALETAKK ADPELEETTV LLSNIKEELS PEPKKYKKKG
HKSKKGKAVK KRGKNKAGKT SKAKKSKAKT KK