Gene GSU1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1103 
Symbol 
ID2686921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1188772 
End bp1190430 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID637125772 
ProductAMP-binding domain protein 
Protein accessionNP_952156 
Protein GI39996205 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAC CGCTTGAATT TACGGTAGGA GGATTGCTTG ACCACATTGC GGCCCGCTAT 
CCCGACAATG ACGCTCTCGT CTATGTTGAT CGCGGCCTCC GTTACTCCTA TCGCCAGTTC
AACGAAGTCT GCCGCGAGGT GGCCAAGGGG CTGCTGCGGC TCGGCGTCAA GAAGGGCGAT
CATGTTTCCA TCTGGGCCTA CAATGTCCCC GAATGGGTGA TCCTTCAGTT CGCCACGGCC
AAAATCGGCG CTGTGCTCGT AACGGTCAAC ACCAACTACA AATCGGCGGA GCTGGAGTAT
ATCCTCAACC AATCCGACTC TTCGACGCTC TTCCTTGTGA AATCGTTCAA GGATACCGAT
TATGTGGCCA CCGTGAACGA GGTCGTGCCC GAGCTGGCCG GGTCGGAAGC CGGCGCCCTT
TCCAGTCCGA AACTCCCTTT CCTCAGGAAT GTCGTCTTCA TCGGCAGCGA GACGCCGGCT
GGCATGCTCA ATTTCGAGGC AATTGCCGCC ATGGGTCAGG ATGTCTCCGA TGCCGAACTG
GCGGCTGTTG AGGCAACCCT CGATCGCCAC GACGTCATCA ATATGCAGTA CACCTCGGGC
ACCACCGGCT TTCCCAAGGG GGTCATGCTG ACCCACTTCA ACATCATCAA CAACGGCTTC
AACATCGGTG AATGCATGAA GTTCACCGAG AAGGACCGGC TCTGCATCCC GGTCCCCTTT
TTCCACTGCT TCGGCTGCGT GCTCGGCGTC ATGGCCTGCG TCACCCACGG CACGACCATG
GTGCCGGTGG AGATTTTCGA CCCCCTCTCG GTGCTGCGCA CCATCGAAAA GGAGCGCTGC
ACTGCGGTTC ACGGCGTTCC CACCATGTTT ATCGCGGAGC TGGAGCACCC GGACTTTCCG
AAGTTCGATC TCACGAGTCT CCGCACCGGG ATCATGGCCG GCTCCAACTG CCCCATCGAG
GTAATGAAAA AGGTCATCAG CCAGATGAAC GCCAGCGAAA TCACCATCGC CTACGGCCAG
ACCGAGTCGT CGCCCGTGAT CACCCAGACC CGCACCGACG ATGCCATCGA GCTCCGGGTG
GCCACGGTCG GCCGGGCGCT TCCCGACGTG GAGGTCAAGA TCGTCGATAT CGAAACCGGT
GCCGAACTGC CGCCGGGCAA GCAGGGCGAG CTGTGCACCC GCGGATACCT GGTCATGAAG
GGGTACTATA AAATGCCCGA GGAGACTGCG CGGGCAATCG ATGCCGACGG CTGGCTCCAC
ACGGGTGACC TGGCCGTCAT GGATGAGAAC GGCTACTGCA AGATCACCGG TCGGATCAAG
AATATGATCA TTCGAGGCGG TGAGAATATC TATCCTCGCG AGATAGAGGA GTTTCTCTAT
ACCCACCCGA AAATATCAGA TGTTCAGATC TATGGCGTCC CCGACCGCAA GTACGGGGAG
CAGGTCATGG CCGCGGTCAT TCTCAAGAAG GGCGACACCA TGACCGAAGA GGATGTGCGT
GATTTCTGCA GGGGCAAGAT CGCCAACTAC AAGATTCCCA AGTACGTTAA ATTCGTCGAT
TCCTACCCCA TGACCGCCTC TGGCAAGATC CAGAAGTTCA AGCTGCGGGA AATGGCCATC
AAGGAGCTCG GCCTCGAAGG GCCGGGGGAA ACCGCTTGA
 
Protein sequence
MAQPLEFTVG GLLDHIAARY PDNDALVYVD RGLRYSYRQF NEVCREVAKG LLRLGVKKGD 
HVSIWAYNVP EWVILQFATA KIGAVLVTVN TNYKSAELEY ILNQSDSSTL FLVKSFKDTD
YVATVNEVVP ELAGSEAGAL SSPKLPFLRN VVFIGSETPA GMLNFEAIAA MGQDVSDAEL
AAVEATLDRH DVINMQYTSG TTGFPKGVML THFNIINNGF NIGECMKFTE KDRLCIPVPF
FHCFGCVLGV MACVTHGTTM VPVEIFDPLS VLRTIEKERC TAVHGVPTMF IAELEHPDFP
KFDLTSLRTG IMAGSNCPIE VMKKVISQMN ASEITIAYGQ TESSPVITQT RTDDAIELRV
ATVGRALPDV EVKIVDIETG AELPPGKQGE LCTRGYLVMK GYYKMPEETA RAIDADGWLH
TGDLAVMDEN GYCKITGRIK NMIIRGGENI YPREIEEFLY THPKISDVQI YGVPDRKYGE
QVMAAVILKK GDTMTEEDVR DFCRGKIANY KIPKYVKFVD SYPMTASGKI QKFKLREMAI
KELGLEGPGE TA