Gene GSU1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1644 
Symbol 
ID2687346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1802943 
End bp1804622 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content59% 
IMG OID637126325 
Productputative ABC transporter ATP-binding protein 
Protein accessionNP_952695 
Protein GI39996744 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG ACAGCACGAA AATCATCTAC ACCATGATGA GGGTGTCCAA GTACTACGAC 
AAGAAGCCCG TCATCAAGGA CATCTCCCTT TCCTATTTTT ATGGGGCAAA GATCGGTGTC
CTGGGTCTCA ACGGCTCGGG CAAGTCGACG CTGCTGCGGA TCATGGCCGG GGTCGACAAA
GACTTCAACG GTCAAGCTGT GCTTTCGCCC GGTTACACCG TCGGCTACCT TGAGCAGGAA
CCCAAGCTCG ACGAGTCGAA AACCGTCCGT GAAATCGTCG AGGAAGGGTG CCAGGAAACC
GTCAATCTCC TGAACGAATT CAACGAAATC ACCGCCGCCT TTGCCGATCC CGACGCTGAT
ATGGACAAGC TGCTTGAGCG CCAGGCAGAG GTGCAGGAAA AGCTCGATCA CCTGGATGCC
TGGGATCTGG ACTCGCGCCT GGAAATGGCA ATGGATGCCC TGCGCTGCCC CCCCGGCGAC
ACGCCGGTGA ACGTTCTGTC CGGCGGCGAG AAGCGCCGCG TGGCTCTGTG CCGCTTGCTC
CTTCAAAAGC CCGACATTCT TCTGCTGGAC GAACCAACCA ACCATCTGGA CGCCGAAACC
GTCGCCTGGC TGGAACATCA CCTGCAGAGC TACCCCGGGA CGATTATCGC CGTCACCCAC
GACCGGTACT TCCTCGACAA CGTTGCCGGC TGGATTCTTG AGCTCGATCG AGGCGAGGGA
ATACCCTGGA AGGGCAATTA TTCCTCGTGG CTTGATCAGA AGCAGAACCG CCTGGCTCAG
GAGGAAAAGG CGGAGAGTGA ACGCCAGAAG ACCCTGCAGC GCGAGCTCGA GTGGATCAGG
ATGAGCCCCA AGGGACGCCA CGCCAAAAGC AAGGCGCGCA TCAGTTCCTA CGAGCAGCTC
CTGTCCCAGG AGAGCGAGAG GCGGGCCAAG GACCTGGAGA TCTATATCCC GCCCGGTCCC
AGGCTCGGTG ATATCGTCAT CGAGGCTGAC AATGTCTCCA AGGCCTACGG GGACCGCCTG
CTGGTCGAGG GGATGAGCTT CCGTCTGCCG CCCGGCGGGA TCGTCGGTGT TATCGGCCCC
AACGGAGCGG GCAAAACGAC CCTTTTCCGG ATGATTACCG GTCAGGAGGA GCCTGATGCC
GGCACCATAC GCACCGGCGA GACGGTGAAA ATCGCCTACG TGGACCAGAG CCGCGACGCC
CTTGATCCGA ACAAGAGTAT CTGGGAAGAA ATCTCCGGCG GCCAGGAGAC GCTTCAGCTC
GGCAAGGTGA GTGTCAATTC CCGCGCCTAT GTCTCCCGCT TCAATTTCTC GGGGAGTGAT
CAGCAGAAAA AAGTGGGGAT GCTCTCCGGC GGAGAGCGGA ACCGGGTCCA TATAGCAAAG
ATGCTCAAGG AGGGAGGAAA CGTCATCCTT CTGGACGAGC CGACCAACGA TCTGGATGTG
AACACCATGC GGGCATTGGA AGAAGCTCTG GAAAACTTCG CCGGATGCGC TGTGGTAATC
TCCCACGACC GATGGTTCCT CGACCGTATC GCCACCCATA TCCTCGCCTT CGAAGGGGAC
AGCCAGGCAG TCTGGTTCGA CGGAAACTAC TCGGAATACG AAGAGGATCG CAAGAAGCGC
CTCGGTGCCG CCGCTGACAT GCCGCACCGG ATCAAGTACC GGCAGTTGAC GCGGGTATAA
 
Protein sequence
MSTDSTKIIY TMMRVSKYYD KKPVIKDISL SYFYGAKIGV LGLNGSGKST LLRIMAGVDK 
DFNGQAVLSP GYTVGYLEQE PKLDESKTVR EIVEEGCQET VNLLNEFNEI TAAFADPDAD
MDKLLERQAE VQEKLDHLDA WDLDSRLEMA MDALRCPPGD TPVNVLSGGE KRRVALCRLL
LQKPDILLLD EPTNHLDAET VAWLEHHLQS YPGTIIAVTH DRYFLDNVAG WILELDRGEG
IPWKGNYSSW LDQKQNRLAQ EEKAESERQK TLQRELEWIR MSPKGRHAKS KARISSYEQL
LSQESERRAK DLEIYIPPGP RLGDIVIEAD NVSKAYGDRL LVEGMSFRLP PGGIVGVIGP
NGAGKTTLFR MITGQEEPDA GTIRTGETVK IAYVDQSRDA LDPNKSIWEE ISGGQETLQL
GKVSVNSRAY VSRFNFSGSD QQKKVGMLSG GERNRVHIAK MLKEGGNVIL LDEPTNDLDV
NTMRALEEAL ENFAGCAVVI SHDRWFLDRI ATHILAFEGD SQAVWFDGNY SEYEEDRKKR
LGAAADMPHR IKYRQLTRV