Gene GSU3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3401 
Symbol 
ID2688178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3743342 
End bp3744532 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID637128096 
Productbranched-chain amino acid ABC transporter, periplasmic amino acid-binding protein, putative 
Protein accessionNP_954441 
Protein GI39998490 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCA GGAAAATTGC ATCGCTGCTG ACCGTTGCCG CAGTAGCCTC GGCCTTTGCG 
GTGGGCTTTG GCTGCAAGAA GAAGGAAGAA GCTCCGGCCG GCGCCCCGGC TGCCGCCGGC
GACACCATCA AGATCGGCTT CCTCGGCGCC CTGACCGGCG ACGTGGCCAT GTTCGGCAAG
CCGACCCTCG ACGGCATGAA GATGGCCGCT GAAGAACTCA ACGCAGCAGG CGGCGTTCTC
GGCAAGAAGA TCGAGATCGT CGAGGCCGAT AACCGTGGCG ACAAGCAGGA AGGTGCCTCG
GTCACCCAAA AGCTCATCAG CCGTGACGGC GTAGTTGCCA TCGTTGGCGA CCCCACCACC
GGCATTACCA AGGTTGCCGC TCCCATTGCC CAGAAGGCCC AGGTGGTACT CCTCTCCGCC
GGCGCCACGG GCCCCGGCGT CGTGGAAAAC GGCGATTTCA TCTTCCGCAA CACCCTGCTG
GACAGCGTCG CCATCCCGGC CTGCATCGAC TTCTTCGCCA AGGACCTGGG CTACAAGAAG
GTTGCGATCA TCACCTCCGA CAACAACGAT TACAGCGTGG GCCTCTCCCA GACCTTCCGC
GACGCCGCCA AGGGCAAGGG TGTTGAAATC GTTGCCGACG AGAAGGTAAA GGACGGCGAC
AAGGACTTCA GCGCCCAGAT CACCAACATC AAGGGCAAGA AGCCCGATGT TATCTTCTTC
TCCGGCTACT ACACCGAGGG TGCCCTCATC ATGAAGGAAG CCCGCAAGCA GGGCCTCAAG
GCCAAGATGT TCGGTGGCGA CGGCCTCTTC TCGCCGAAGC TGATCGAGCT GGGCGGCGAT
GCGGTTGAGG GCACCATGTC CGCCCTCGGC TTCTCTCCCG AGCAGGCTTC TCCGGTTACC
GCCAAGTTCG TCGAGGCGTA CAAGAAGAAG TTCAACGGCG TCGAACCGGG TCTCTTCGAC
GCTCAGGGAT ATGACGGCGT CATGATGCTG GCCGATGCCA TGAAGCGTGC CAACAGCGCA
GATCCGAAGG TGTTCAAGAC CGCCCTCGGC CAGACCAAGA ACTATGAAGG GGTCTCGGGG
ACCATCACCA TCCGCGAGAA CCGCGAGCCG ATCAAGTCGC CGCTGGCCCT TCTGGAAGTA
AAGGGTGGGA AGTTCGCCCT GAAGGCCAAG GTTCCGGTCA AGATGGACTA A
 
Protein sequence
MKFRKIASLL TVAAVASAFA VGFGCKKKEE APAGAPAAAG DTIKIGFLGA LTGDVAMFGK 
PTLDGMKMAA EELNAAGGVL GKKIEIVEAD NRGDKQEGAS VTQKLISRDG VVAIVGDPTT
GITKVAAPIA QKAQVVLLSA GATGPGVVEN GDFIFRNTLL DSVAIPACID FFAKDLGYKK
VAIITSDNND YSVGLSQTFR DAAKGKGVEI VADEKVKDGD KDFSAQITNI KGKKPDVIFF
SGYYTEGALI MKEARKQGLK AKMFGGDGLF SPKLIELGGD AVEGTMSALG FSPEQASPVT
AKFVEAYKKK FNGVEPGLFD AQGYDGVMML ADAMKRANSA DPKVFKTALG QTKNYEGVSG
TITIRENREP IKSPLALLEV KGGKFALKAK VPVKMD