Gene Acid345_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2272 
Symbol 
ID4073266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2692272 
End bp2693648 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content60% 
IMG OID637984288 
Productcytochrome c, class I 
Protein accessionYP_591347 
Protein GI94969299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.242682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTCCTC ATTTCCCCCA AAGCGTCGGC CCGTATAGTT GCGATGCCGC AACTACAGTT 
GCAGAAGTCT GCAACTGTCG CCCGGCACGT TTCGCGATTC TGCTTGACCC CTTCGTAGCG
CCGAATTACT ATCCGGCGCA GTTGGCTCCG GCCAATTCCC CCCAATTCCG CACCAGAGCG
ATTTTGATCG CTGGTGCAGG CCCATATGTA GCCAGGTTGC GCGCTTTTGC GCGCGCCCGG
CGGTCAGCGT TTTTCGTAAA TTTGACCCTC GGCCCCAAAA TTTTCAAGTT CAAGCAGGTA
GATGTGAAAG TTCGTCTCTC AGGCAGCAAC CTTTTGTGGA TGTCGTCCGC CATGCTGGTG
TGTCTTACCT TCAGCAATCC AATCGGAACA CACGCCGCGC CGCAAGCGTC GGGAAGCAGC
GCATCGTCCG TACTCGAGAT GGACGTCATT CCGCGAACCC CCGAACGCCT CGCCCGCGGC
CAGTATCTCG TCGAAGGACT GCTGCAATGC CCGGCTTGTC ACTCCGAGGT TAATTTCGGC
AAACGTCCGC CGGAACCCAT GCCCGGCGCA AAGCTCGGCG GACACATCTT CGCAAACGCT
GAACTCGGAT TGCCGGAGCC GAACCGCATC GTCGCGCCGA ACATCTCATC CGATCCCGAG
TATGGCGCCG GCACGTGGAA AGACGCAGAC TTTGTTCGCG CTCTCCGGCA GGGTATCGGC
CATGACGGCC GCACGCTCTT CCCGCTCATG CCCTACGAGT TCTTCCGCCA GCTTTCCGAT
GAAGACCTCG CCGCGGCGAT TGTTTACATC CGCTCGCTGC CGCCAGTTCA TCACGAGCAG
CCGAAGACCT TCGTCACCGA AGACCTGAAG AAAACCTACA AGCCGTTTCC GATGCCAGCG
TCCGTCGCTG AGCCGGACCG CTCCGATCGC GTGGCCTACG GCAAGTACTT GGCGACCGCC
GGACATTGCG GCGCCTGCCA CGACGGCTAC GACGACAAAG GCGCCCCCAT CCCCGGCATG
CAATTCTCCG GCGGAGCCCC GCTCACCGGC CCATGGGAAG GTGGCGAGAA GGTCATCAGC
GTGAACGCTG CCAACCTCAC GCCCGATCCC TCCGGCATTG GCTACTACAA CGAGGCGATG
TTTATCGAAG TCATCCGCAA CGGCGGATTC AAGGCGCGTC CGCTCTCCAA CATCATGCCG
TGGTCGTTCT TCCGCAACCT CACCGACGAC GATCTCAAAT CCCTTTTCGC GTACCTGCAG
TCGCTGAAGC CCGTTTGCCA TCACGTGGAT AACACCGAAG TCGCCACCTA CTGCAAGAAG
TGCAAAACCA AACACGGTTT GGGCGAGATG AACGAGGAGA CGCTCACGGC AAAATGA
 
Protein sequence
MPPHFPQSVG PYSCDAATTV AEVCNCRPAR FAILLDPFVA PNYYPAQLAP ANSPQFRTRA 
ILIAGAGPYV ARLRAFARAR RSAFFVNLTL GPKIFKFKQV DVKVRLSGSN LLWMSSAMLV
CLTFSNPIGT HAAPQASGSS ASSVLEMDVI PRTPERLARG QYLVEGLLQC PACHSEVNFG
KRPPEPMPGA KLGGHIFANA ELGLPEPNRI VAPNISSDPE YGAGTWKDAD FVRALRQGIG
HDGRTLFPLM PYEFFRQLSD EDLAAAIVYI RSLPPVHHEQ PKTFVTEDLK KTYKPFPMPA
SVAEPDRSDR VAYGKYLATA GHCGACHDGY DDKGAPIPGM QFSGGAPLTG PWEGGEKVIS
VNAANLTPDP SGIGYYNEAM FIEVIRNGGF KARPLSNIMP WSFFRNLTDD DLKSLFAYLQ
SLKPVCHHVD NTEVATYCKK CKTKHGLGEM NEETLTAK