Gene GSU1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1681 
Symbol 
ID2687401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1841919 
End bp1843010 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content57% 
IMG OID637126362 
Productcobyrinic acid a,c-diamide synthase family protein 
Protein accessionNP_952732 
Protein GI39996781 
COG category[R] General function prediction only 
COG ID[COG0857] BioD-like N-terminal domain of phosphotransacetylase 
TIGRFAM ID[TIGR00347] dethiobiotin synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA AAGTCTTTAT CGGCGCTACA GGACAAAACT GCGGAAAAAC AACCATGAGC 
GTCTCGCTCA TGCACTTGGC GCGACAGAAA TACCAGAGGG TCGGCTTCAT CAAGCCCATC
GGCCCCAAGA TCGAGATGTA CAACGGCCTC ACCGTCGACA TGGACGCCAT CCTCATGGCG
CGGACCTTCG GCCTGGAAGA AGACCTGGCC CTCATGAATC CCGTGCCCCT CCCCAAGAAC
TTCACCCGCG ACTACCTGAG CGGCAAGTTT GACTGCCATA CTCTCAAAAA AAAGATTGTC
GAAGCATTCG AAATACTTGA CCAGAGCTAT GACTTCCTGA TTATCGAGGG TGCCGGCCAC
TGCGGAGTCG GCTCCGTCAT CGGTCTCAGC AATGCCTGTG TAGCCCATAT GCTCGGGGCA
CCGGTAATCG TGGTGACTGA CAGCGGTATC GGCAGCGCCA TCGATGCCGT GCACCTCAAC
CTGGCCCTCT ACGAAAAGGA GGAGGCCGAC GTCCGGATGG TCATCGTCAA CAAGCTCCGC
TCAGACAAGC GGGACTCGAT CCTCGGCTTC CTCAGGCGGG GGTTCCCGGG GCGTTCACTC
CAGGTTACCG GTGGCTTCAA CTATTCTCCC GTCCTGGCAA ACCCGACCCT TTCCCATATC
GGAAAACTTC TCAATCTCCC CGTTCATGGC GACGCGGACG GGCACAGCCG GATCATCCAC
CACATTCACC TGGGAGCGGC ATCGTCCCAA CGGGTGGTGG ACGCCCTGGA GGATGCCACA
CTGCTGGTCC TCACCAGTTC ACGGGATGAG CTGATCGTCA CCCTGTCGTC TCTGTATCAT
ATCCCATCCT ATAGGGACAA GATCGCAGGC CTCGTCATTG CTGGTCACAT GCCGGTATCG
GAAATAACCC AGCAGATTCT GGACGACAGC ATGATCCCCT ACATCCGGGT TCATGACTCC
ACGGCGCGAG TGTTCACAAC GCTTATGGAG GACGTTTCCA AGATCACTGC CGAGGACCAG
GAAAAGCTCA ACTGGATCAG GGCAAATGCC GAGAACGAAA TCGATTTCGA GGCAATAGAC
GCTCTGCTCT GA
 
Protein sequence
MAKKVFIGAT GQNCGKTTMS VSLMHLARQK YQRVGFIKPI GPKIEMYNGL TVDMDAILMA 
RTFGLEEDLA LMNPVPLPKN FTRDYLSGKF DCHTLKKKIV EAFEILDQSY DFLIIEGAGH
CGVGSVIGLS NACVAHMLGA PVIVVTDSGI GSAIDAVHLN LALYEKEEAD VRMVIVNKLR
SDKRDSILGF LRRGFPGRSL QVTGGFNYSP VLANPTLSHI GKLLNLPVHG DADGHSRIIH
HIHLGAASSQ RVVDALEDAT LLVLTSSRDE LIVTLSSLYH IPSYRDKIAG LVIAGHMPVS
EITQQILDDS MIPYIRVHDS TARVFTTLME DVSKITAEDQ EKLNWIRANA ENEIDFEAID
ALL