Gene Afer_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1835 
Symbol 
ID8323929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1920888 
End bp1922156 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content62% 
IMG OID644952966 
Productcitrate synthase I 
Protein accessionYP_003110422 
Protein GI256372598 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAGT CGATCACCAT CACGGACAAC CGCACCGGCG TGTCCGTCGA GATCCCCATC 
GTGGACGGTA CGGTTTCGGC GGCCGATTGG AGCAAGGCGC TGCCGGGCAT CTGGTTCCTC
GATCCCGCAC TGGTATCGAC CGCGATGGCC GAGAGCGCGA TCACGTACCT CGATGGCGAG
GCGGGGATTC TGCGCTATCG GGGCTATCCG ATCGAGCAGC TCGCCGAACG CTCGACCTAT
CTGGAGGTGG CGTACCTGCT GCTCCACGGC GAATTGCCGA CCGCGAGCGA GCTCGAGACG
TGGGTCGACG AGATCACGCA CCACACCTTC ATCCACGAGA ACGTTCGCAA GCGCTTCCTC
GAGGGATTCC ACTACGACGC CCACCCCATG GGCATCTTGG TCTCGGCGGT CGCCGCGCTC
TCGACGTTCT ACCTGGACGC CAAGGACATC TTCGATCCCG ATGCCAGACA TCGACAGATC
ATTCGTCTGA TCGCCAAGAT GCCAACCCTC GCTGCGGCTG CCTATCGATT CTCGCAAGGG
ATGCCGTTCG TCTATCCGGA CAACTCGCTG TCGTTCCCGG CGAACTTCCT GTCGATGATG
TGGAAGATCG CCGAGCCTCG CTACGAGGCA GATCCGCGTC TCATTCGAGC GATCGACGTG
CTGTTCATCC TCCACGCCGA CCACGAGCAG AACTGCTCGA CCACGGCGAT GCGCGTCACG
GGCTCAGCGC ACTCTGACCC CTACTCATCC GCGGCAGCGG CCTGCGCGGC GCTCTATGGT
CCGCGGCATG GCGGCGCCAA CGAGGCCGTC GTACGCATGC TGACGGAGAT CGGCTCGATC
GACAATGTGC CCGCGTTCGT CGAATCCATC AAGCGTGGCC ATGGCATCCT CCAGGGCTTC
GGGCATCGGG TGTACAAGAA TTACGATCCT CGCGCGCGCA TCATCAAGGA AGTCGCCTAC
GACGTGTTCG AGGTCACGGG GAAGAACCCA CTCCTCGATA TCGCCCTGAA GCTCGAGGAG
GTCGCGCTCT CCGATGAGTA CTTCATCTCG CGCAAGCTCT ATCCGAACGT GGACTTCTAC
TCGGGCCTGA TCTACCAAGC CATGGGTTTC CCCGTGGAGA TGTTCCCGGT GCTGTTTGCG
ATCCCTCGTA TGTCGGGGTG GCTTGCACAC TGGAACGAAC TGCTCGATCA AGACGCCAAG
ATCGTGCGCC CTCGCCAGCG CTACATCGGC GCTCCGAAGC GCGACTACGT TCCGGTCGAG
CAGCGCTGA
 
Protein sequence
MPESITITDN RTGVSVEIPI VDGTVSAADW SKALPGIWFL DPALVSTAMA ESAITYLDGE 
AGILRYRGYP IEQLAERSTY LEVAYLLLHG ELPTASELET WVDEITHHTF IHENVRKRFL
EGFHYDAHPM GILVSAVAAL STFYLDAKDI FDPDARHRQI IRLIAKMPTL AAAAYRFSQG
MPFVYPDNSL SFPANFLSMM WKIAEPRYEA DPRLIRAIDV LFILHADHEQ NCSTTAMRVT
GSAHSDPYSS AAAACAALYG PRHGGANEAV VRMLTEIGSI DNVPAFVESI KRGHGILQGF
GHRVYKNYDP RARIIKEVAY DVFEVTGKNP LLDIALKLEE VALSDEYFIS RKLYPNVDFY
SGLIYQAMGF PVEMFPVLFA IPRMSGWLAH WNELLDQDAK IVRPRQRYIG APKRDYVPVE
QR