Gene Acid345_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3002 
Symbol 
ID4071557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3554248 
End bp3555654 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content61% 
IMG OID637985021 
Productpolysulphide reductase, NrfD 
Protein accessionYP_592077 
Protein GI94970029 
COG category[C] Energy production and conversion 
COG ID[COG5557] Polysulphide reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00130542 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCAAG GTCCAGACGA GAAAACAGTC TTACAAGACG TCCTTGAGCC GGGCGAACCA 
CCAGTACTTG CCCCCGGACA CGACTTCGGC ACGGTTTCGG ACAAGATCTC CGCGATCGTG
CTCCGTCGCC CGCTGGGGCT GGGCTGGACG GGCGCTGCCT TCATCGGCTT CGTCTTCGTC
AACCTGCTCC TCATGGCCGT CACCTGGTTG TTCCTGCGCG GTGTCGGCAT CTGGGGCATC
AACGTTCCGG TGGCATGGGG CTTCGCGATC ATCAACTTCG TGTGGTGGGT CGGTATCGGC
CACGCGGGCA CGCTGATCTC TGCCATCCTG CTGCTCCTCA AGCAGACTTG GCGTAACTCC
ATCAACCGTT TCGCCGAAGC CATGACGCTC TTCGCCGTGG CCTGCGCCGG CATGTTCCCG
CTCTTCCACG TCGGTCGCCC TTGGTTGGCC TACTGGTTGT TCCCGTATCC CAACTCCATG
GGGGTGTGGC CGCAGTTCCG CAGCCCGCTC GTATGGGACG TCTTCGCGGT CTCTACTTAC
GCCACCGTCT CCGCGATTTT CTGGTTCGTC GGCCTCATCC CCGACATCGC CACCCTCCGC
GATAGCGCGA AAAATCCCTA CGCACGGACG ATTTACGGCT TGCTCGCCAT GGGATGGCGC
GGCTCCGCCC GGCACTGGAA GCGTTATCAG TCGGTGTACC TGCTCCTCGC CGGCCTCGCC
ACGCCCCTCG TCCTCTCCGT GCATACGGTC GTGTCCTTCG ACTTCGCCGT CGCGCAATTG
CCCGGATGGC ACACCACGAT CTTCCCGCCA TACTTCGTCG CAGGCGCCAT CTACTCCGGC
TTCGCCATGG TTCTCGTTCT GGCGATTCCG ATCCGTCACT ACTACGGCGT GAGCGACATG
ATCACCTCGC GCCACCTCGA GAATGCCGCC AAGATCATGC TCGCCACCGG CCTCATTGTG
GCTTACGGCT ACTTCATGGA AATCTTCATG GCGTTCTACG GGACGAATAT CTACGAGCGC
GCCATTGTGT GGACTCGTTG GCGCGGGCCG TACGCTCCGG GATATTGGGC GTTGATCGCC
TGTAACATTC TCATCCCGCA AGTCCTGTGG ATCCCGAAGG TTCGTAAGAG CCCGTTTTGG
CAGTTCGTCA TCTCAATGGA CATCCTCATC GGCATGTGGC TGGAACGCTT CATCATCGTC
GTCACCAGCT TGCACCGCGA CTTCCTGCCC TCCTCGTGGG GCATGTACAC CCCAACCCGT
TGGGACTGGG CGACCTACCT CGGTACGCTC GGCTTCTTCC TCTTCGCGTT CGTGTTGTTC
ATCCGCGTGC TGCCGATGAT CACTATTTTC GAAATCAAAG CGCTGCTGCC CGAATCTGAG
CCGCACGCGG TGGAGGTGAA GGCGTAA
 
Protein sequence
MHQGPDEKTV LQDVLEPGEP PVLAPGHDFG TVSDKISAIV LRRPLGLGWT GAAFIGFVFV 
NLLLMAVTWL FLRGVGIWGI NVPVAWGFAI INFVWWVGIG HAGTLISAIL LLLKQTWRNS
INRFAEAMTL FAVACAGMFP LFHVGRPWLA YWLFPYPNSM GVWPQFRSPL VWDVFAVSTY
ATVSAIFWFV GLIPDIATLR DSAKNPYART IYGLLAMGWR GSARHWKRYQ SVYLLLAGLA
TPLVLSVHTV VSFDFAVAQL PGWHTTIFPP YFVAGAIYSG FAMVLVLAIP IRHYYGVSDM
ITSRHLENAA KIMLATGLIV AYGYFMEIFM AFYGTNIYER AIVWTRWRGP YAPGYWALIA
CNILIPQVLW IPKVRKSPFW QFVISMDILI GMWLERFIIV VTSLHRDFLP SSWGMYTPTR
WDWATYLGTL GFFLFAFVLF IRVLPMITIF EIKALLPESE PHAVEVKA