Gene Gura_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2022 
Symbol 
ID5166151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2357088 
End bp2359139 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content61% 
IMG OID640549516 
Productfibronectin, type III domain-containing protein 
Protein accessionYP_001230785 
Protein GI148264079 
COG category[R] General function prediction only 
COG ID[COG3401] Fibronectin type 3 domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00638663 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAC AGATTAAAAG AACGTTCGCC GGCATCACCG TACTCTGTCT TGTTGCCGTC 
GGTCTTATGC TGACCGGCTG CGGCAGCAGC GGCGGAGGTC TTTCATCCCA GGTCGTGAGT
GGCGTTGCTG CGGTCGGCGC TCCACTTGCC GGGCAGGTAA ACCTTAAGGA CGCCTCTAAC
CCTCCACAGG AAAAGTCGAC CGTTATCGGT AATGACGGTA CGTTTGCCTT CGATGTCACG
GGCATGAAAG GCCCGTTCAT CCTGCAGGCG TCAGGGCGCG CCAACGGAAC GAATTACGCG
CTTCATTCCT TTGCCGGCGG AACGGGCACA GCCAACGTCA ACCCGCTCTC AAATGCGGCG
GTAGCCAGTG CTGCAGGGGT TGACGACCCG TCACAGGTCT TCGCGAATCC CGACCCGGTC
ACGCTCCAGA AAATCGAGTC CAATCTCCAG ACAGCTGTCG CCACTATCCT GTCCAAGCTC
CACCCGCTCC TGAAGCAGTA CAGCGCAGAC AACTCCGATC CGATCAAGGG GCACTACACC
GTTGACCACA CCGGTCTCGA CGGCATGCTG GACAATGTGA AAATGACCCT ATCCAACGGG
GTTCTTACGA TAGTGAATGC GAAAACCGGC GCAGTCATTT TCAGCGGCAA GATCTCCGAC
ATCAATAACT GGAATTTTTC GGACGACGAT AATAACATTC CCGCCCCGCC TGCCGTGCCC
GCCGCTCCAG CCGGCTTGAC CGCCACCGGC GCTGCCGGCC AGATGACCCT TTCCTGGAAC
GCCGTCAGCA ACGCGACCTC GTACAATGTC TACTACTCGA CCACTGGCGG TGTCTCTGCC
GCCAATGGGA CAAAGATCGC CGGGGCCACC AGTCCTTATG TCCAGAGCGG CCTTACCGCA
GGAACCACCT ATTACTACAT CGTTACGGCA GTGAACAGCG CCGGCGAAAG CGCTGCCTCG
GCCCAGGTTT CGGCGACCAC CAATGCGACG CCGACACCGA CGCCGACTCT CCCTGCTGCA
CCGACCGGAG TAATGGCCAC AGGCGGCACC AACCAGGTGA CCCTCTCCTG GAGCGCCGTC
AGCAATGCTG CCTCGTACAA CATTTACTGG TCGACCAAGA CAGGCGTCAC GACGAGCAAC
GGGACAAAGA TCAGCGGTGC CATGAGTCCT GCGGTTCAGG CGGGGCTTGC TGCCGGCACG
ACCTATTACT ACATCGTTAC GGCAGTGAAC AGCGCAGGCG AGAGCACGCC TTCCGTCCAG
GTTGCGGCGA CCACCGTCAC TCCGACTCCC GCTCCGACCG TGCCGGCTGC CCCGTCCGGC
GTGACCGCCA CCGGCGGCGC CAAGCAGGTG ACGCTGTCCT GGCCGGCAGT ATCCGGCGCA
ACCTCCTATA ATGTCTACTG GTCTACCGCT TCCGGCGTAA CGACCGCGAA CGGCACGAGA
ATCGCCGGGG CCACCAGCCC TTATGTTCAT ACCGGTCTTT CCGCAGGGAC CAGCTACTAT
TACATAGTCA CGGCTGTAAA CGGCGCGGGC GAGAGTGCTC CATCAACTCA GGCGACCGCA
ACCACCAATG CCCCACTTCC GGCCGTTCCT GCTGCACCGA CCGGTGTGAC CGCCACAGGC
GGCGCCAATC AGGTGTCTCT CTCCTGGTCG GCGGTCTCCG GCGCGACATC GTATAACGTT
TACTGGTCTA CGACTTCAGG GGTTACGACC GCTTCCGGGA CAAAAATCGC CGGGGCCACC
AGTCCCTACG TCCAGACCGG GCTTGCCGCC GGCACCGCCT ACTACTACAT CGTAACGGCG
GTGAACAGCG CCGGTGAGAG CGCCGCGTCG GCAAAGACCA CAGCGACTAC CGCCGCCCCT
GCAATCGACG GTGCAGCGCT TTATTCACAG TACTGCGCCG GGTGTCACGG AGCCCTGGCA
TCGTCCAACA AGAGGAAAAC GACCGCTTCC AAGATCCAGT CGGGGATCAG CGGCAATGTC
GGCGGAATGG GATATCTTTC CTCCCTCTCG GCGGCACAAA TTCAGGCCAT TGCTACGGCT
TTGAATTTCT AG
 
Protein sequence
MREQIKRTFA GITVLCLVAV GLMLTGCGSS GGGLSSQVVS GVAAVGAPLA GQVNLKDASN 
PPQEKSTVIG NDGTFAFDVT GMKGPFILQA SGRANGTNYA LHSFAGGTGT ANVNPLSNAA
VASAAGVDDP SQVFANPDPV TLQKIESNLQ TAVATILSKL HPLLKQYSAD NSDPIKGHYT
VDHTGLDGML DNVKMTLSNG VLTIVNAKTG AVIFSGKISD INNWNFSDDD NNIPAPPAVP
AAPAGLTATG AAGQMTLSWN AVSNATSYNV YYSTTGGVSA ANGTKIAGAT SPYVQSGLTA
GTTYYYIVTA VNSAGESAAS AQVSATTNAT PTPTPTLPAA PTGVMATGGT NQVTLSWSAV
SNAASYNIYW STKTGVTTSN GTKISGAMSP AVQAGLAAGT TYYYIVTAVN SAGESTPSVQ
VAATTVTPTP APTVPAAPSG VTATGGAKQV TLSWPAVSGA TSYNVYWSTA SGVTTANGTR
IAGATSPYVH TGLSAGTSYY YIVTAVNGAG ESAPSTQATA TTNAPLPAVP AAPTGVTATG
GANQVSLSWS AVSGATSYNV YWSTTSGVTT ASGTKIAGAT SPYVQTGLAA GTAYYYIVTA
VNSAGESAAS AKTTATTAAP AIDGAALYSQ YCAGCHGALA SSNKRKTTAS KIQSGISGNV
GGMGYLSSLS AAQIQAIATA LNF