Gene GSU1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1945 
Symbol 
ID2685513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2130067 
End bp2133210 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content60% 
IMG OID637126636 
Productfibronectin type III domain-containing protein 
Protein accessionNP_952994 
Protein GI39997043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.771273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCC GCACGCTAAC AACTCTTATT CTTGCCTGGC TGCTCAGTTT TACCATGGTG 
GCAGCCAGCC AGGCCCGGGA CGTGACCCTG CAGTGGGATG CAAACACTGA AACGACCGTT
GCCGGCTACA AGGTCTACTA CAACGCCGAC TCGGCAGGCC CTCCCTTCAG CGGGACCGGA
ACGGTCGGCA AAGTGACCTC CACCACTCTC ACCGGTCTCG ACCCGAGCAA GACCTATTAT
TTCGCCGTCA CCGCCTACGA TGCCACCGGC ACTGAAAGCA CCTACTCAAA TATCGTAAGC
GCGGCCGAAG CAACTGCCCC CACCGTTTCG ATTACATCCC CCGCCTCGGG CTCGTCCATC
TCCGGTACCA CCAGCGTTGC CATCAGCGCC ACTGACAATG TGGGCGTCAC ATCGGTTGAA
CTCTATGTGG ACGGCGTCCT CAAAGGAACC GACACCTCAT CTCCGTACAG TGTGAGCCTG
AACACCACCC AGCTTTCCGC CGGCACTCAC ACTCTCCAGG CCAAAGCGTA CGACGCCGCC
GGGAACGTGG GACAGTCCAC GGCGTTTTCC GTGACAGTGG TAAATGATAC GACGGCTCCT
ACGGCTTCCA TTACCTCTCC GACCAGCAGT TCAACCGTGT CGGGAACCGT TACGGTAAAC
GTTTCCGCCA CCGACGCCAT GGGCGTGTCA AAGGTGGAGC TCTATGTTAA TGGTTCCCTT
TACGCGACCT ACGGTTCGGC TCCCTACTCG ATAACCTGGA ACACCTCGTC GTATGCCAAC
GGGTCGTACA CCCTCCAGGC CAAGGCTTAT GACGCGGCGG GCAACGTTGG TCAGTCTTCC
TCGGTGGCAG TCACCGTAAG CAATACGGTC GCCGACACCA CGGTTCCGAC CGTTGCCGTG
AGCTCCCCCG CCAATGGCGC GACCGTGACC GGTACGGTGA GCATGGCCGC CACGGCGTCG
GACAACGTCG GTGTGAGCAA GGTCGAATTC TATGTGAACA ACGTGCTCAA GGGGTCGGAT
ACGACGTCTC CCTACAACTA CAGCTGGGAT ACCACCTCCA CGGCCAACGG CAGCTACTCC
CTGACCGCCA AGGCTTATGA CGCGGCGGGC AACGTTGGTC AGTCTTCCTC GGTTACGGTC
ACCGTAAGCA ATACGGTCGC CGATACCACG GTTCCGACCG TTGCCGTGAG CTCCCCCGCC
AATGGTGCAA CCGTGACCGG TACGGTGAGT ATGGCCGCCA CGGCGTCGGA CAACGTCGGT
GTGAGCAAGG TCGAATTCTA TGTGAACAAC GTGCTCAAAG GGTCGGATAC GACGTCTCCC
TACAACTACA GCTGGGATAC CACCTCCACG GTCAACGGCA GCTACTCCCT GACCGCCAAG
GCTTATGACG CGGCGGGCAA TGTGGGGCAG TCGAGCAGCG TAACCGTGAG CGTGAACAAT
GTGACGACTC CCCCGTCGGG AAGCAACACG GCCATCTTCG GTAATGCCTT TGGCGCCAAT
TTCCCGAATA CCGTGGAAGA CACCTTCCTG AATATCAACG ACGACGTGAA CGCCACCGGC
GTAAGCCTCA GCACCTATAC CTGGCCCGCT GCCACGCCGG CCAACGCAGT CGTGATGAAG
TGGGACGTTT CCGCACTGCC TGCCAATGCC GAGATCCAGA GCGCCACGCT CTACCTCTAC
CTGACTGAAG GCGGTGGTGA CGACGCTTAC GAAATTCCGG TCTCCGCCAT CATCAACAAG
AACCCGGTTG TCGCTTCAAG CACTGGCAAT ACCTACGACG GGACCAATGC CTGGACCGCG
AGCAGTGTGG CTTACGGCGG GGTGCCCCTG GCCCAGTCCG ATATCGACAC GCCGGTTGAT
GCACCTCTGG TTGACAAGAC CGTTGGTTAT AAAGCCTGGA ATATAACCAA TCTGGTCAAG
ACCTGGCTGG CGACGCCGGC CGCCAACAGG GGTGTTCTGC TCAACTCCTC CAACAAGGCC
GCCGTTGACA GCTACAGGCT CTTCGCTTCC AGCGAGGCGT CCGATACCAA TCTGCGGCCG
AAACTGGTGG TAACCTACAA TCTGCCGTCC GATACTGCTG CTCCGACGGT GGCGGTGAGT
GCCCCGGCTA ATGGTGCGAC CGTGAGCGGA ACGGTGACCG TCAGCGCAAC AGCCTCTGAC
AACGTGGGCG TGACCAAGGT GGAATTCATG GTGAACGGCA CGGTCGCTTC CACGGTGACA
ACCGCCCCTT ACAGCTACAG CTGGAACACC ACGACCTCGG CCAACGGCAC CTACACCCTG
ACTGCCAAGG CCTATGACGC TGCCGGCAAC ATCGGCCAAT CTACCTCCGT ATCGGTAACG
GTGAACAACC AGATCGGCGA CACCACTGCG CCGACCGTAT CGATCACTTC TCCGGCCAAT
AACGCGACCG TCAAAGGAAG CATTACCGTC AGCGCCAGTG CATCGGACAA CGTGAAGGTG
ACCAAGGTCG AGTTCTACCT GGACAACGTG CTCAAGCGCA CCGACACGAG CTCGCCCTTT
ACCTACAGCC TCAACACGAC CTCCGTAAGT GATGGCACCC ATACCCTGAC CGCCAAGGCC
TATGATGCGG CAGGCAATAT CGGCGAGACA ACCGTGACGG TGAAGGTCGC CAACGATGCA
ACCGCGCCGA CCGTGTCACT GTCGGCCCCG ACCAGCGGCG CAACGGTAAG CGGTGTCGTC
TCCGTCAATG CCACCGCAAC GGACAACCTG GCAGTTGCCA AAGTCGAATT CTACGTCAAC
AATGTCCTCG CGAGCACGGA TACGACCTCT CCCTACAGCT ACAGCTGGGA TACCTCCACC
GTTGCTAACG GCGTCTACAG CCTGACCGCC AAGGCCTATG ATGCGGCCGG CAACTCGAAG
GTTTCGACCG CCGTAACCGT AACCGTCAAT AACATCGTGA TCATCAAGGG CGATGTGGAT
GGCGACGGAG CGATTACGGC CAATGACGCG CTCATCGTTC TCAAGGCGGT CGCGGATCCC
ACCCTGTTGA CCTCGACAGT GCAGAGCATG GGTGATGTGG CGCCGGTCGA TCCGGTTACC
TCCAAGCCGG TCGGCAACGG CAAGATTGAC ATCAACGATG TACTGATTCT TCTGCGCCGG
GCGGTTGGTC TGACGACCTG GTAA
 
Protein sequence
MKTRTLTTLI LAWLLSFTMV AASQARDVTL QWDANTETTV AGYKVYYNAD SAGPPFSGTG 
TVGKVTSTTL TGLDPSKTYY FAVTAYDATG TESTYSNIVS AAEATAPTVS ITSPASGSSI
SGTTSVAISA TDNVGVTSVE LYVDGVLKGT DTSSPYSVSL NTTQLSAGTH TLQAKAYDAA
GNVGQSTAFS VTVVNDTTAP TASITSPTSS STVSGTVTVN VSATDAMGVS KVELYVNGSL
YATYGSAPYS ITWNTSSYAN GSYTLQAKAY DAAGNVGQSS SVAVTVSNTV ADTTVPTVAV
SSPANGATVT GTVSMAATAS DNVGVSKVEF YVNNVLKGSD TTSPYNYSWD TTSTANGSYS
LTAKAYDAAG NVGQSSSVTV TVSNTVADTT VPTVAVSSPA NGATVTGTVS MAATASDNVG
VSKVEFYVNN VLKGSDTTSP YNYSWDTTST VNGSYSLTAK AYDAAGNVGQ SSSVTVSVNN
VTTPPSGSNT AIFGNAFGAN FPNTVEDTFL NINDDVNATG VSLSTYTWPA ATPANAVVMK
WDVSALPANA EIQSATLYLY LTEGGGDDAY EIPVSAIINK NPVVASSTGN TYDGTNAWTA
SSVAYGGVPL AQSDIDTPVD APLVDKTVGY KAWNITNLVK TWLATPAANR GVLLNSSNKA
AVDSYRLFAS SEASDTNLRP KLVVTYNLPS DTAAPTVAVS APANGATVSG TVTVSATASD
NVGVTKVEFM VNGTVASTVT TAPYSYSWNT TTSANGTYTL TAKAYDAAGN IGQSTSVSVT
VNNQIGDTTA PTVSITSPAN NATVKGSITV SASASDNVKV TKVEFYLDNV LKRTDTSSPF
TYSLNTTSVS DGTHTLTAKA YDAAGNIGET TVTVKVANDA TAPTVSLSAP TSGATVSGVV
SVNATATDNL AVAKVEFYVN NVLASTDTTS PYSYSWDTST VANGVYSLTA KAYDAAGNSK
VSTAVTVTVN NIVIIKGDVD GDGAITANDA LIVLKAVADP TLLTSTVQSM GDVAPVDPVT
SKPVGNGKID INDVLILLRR AVGLTTW