Gene Dshi_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4054 
Symbol 
ID5714583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009957 
Strand
Start bp113345 
End bp114496 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content60% 
IMG OID641276966 
ProductTPR repeat-containing protein 
Protein accessionYP_001542262 
Protein GI159046592 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.331246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGGG GTATTTTCGT TCTTTTGACG CTGGTCGCAA TCGGAATTGT CCTCTACCCG 
CTGCTGCTGT CCAAATCGAA CACTCTCACC AGAGGGGATG CGGTACCGGC GATCCTCGCC
GATCAGATGC GGGAAATCCA ACGAGACATG GACCGCGGTT TGATTTCTGA ACAGGAAGCT
CAAGCGGCCA GACTCGAGAT CAAGAAACGG ATCCTCGCGA CGACCCGCAA TTCGGAAGAA
AAAGCCGGAT CATCCCGTAA TGGTGGCAGA GTCACTCTCG TTGTCGCGGC GGTCCTCGCG
CCGGTTATTG CCGCGGGCTA CTATCTGACG ATGGGATCGC CGGAGGTTCC TTCAATGGCC
TTCGCTGCCC GTGCAGAGGA GCGGGCACAG ACTGATGAAG TGACGGCGCT GGCAATACAA
CTTCGCGAAA GGCTTGTATC CGATCCGACC GGCGGCCCTA GCGAAGGGTG GATGCTGTTG
GGCCAGACCT ATCAGCGCAT GGGCAGAGCA GCCGATGCCG TCGAAGCCTT CGAAGTCGTT
GCCGAACGGG AGGATGCCAC CTCCGCGACA TTCTCCATGC TGGCAGAGGC TGTCGCGGTT
GCCAATGACG GCGTTGTTAT TCCACGGGCG AAATTGGCCA TTGATCGCGC TTTGGACCTT
GACCCATCCA ATCCTGCGGC AACCTACTTC GAGTCTCTGT ATTACTTTCA GCAAGAAGAG
ACGCGTAAGG CCTACGATCT GCTTGTTTCA CGACTGAATC AGGAAGAGGC TTACGTTCCC
TGGATGGAGA CCTATGCAGC GCAGATCAAC CGGATTGCAG CGGCAGCTGA CCTTCCGGTC
GTCAATCTAC CCAGCGGGCC GACGTCGCAA CCAGGCCCGA GTGCCGCAGA TGTCGCGGCG
GCGTCGGAGA TGAATGACGC GGATCGCGCG GAGTTCATCC GATCGATGGT CTCGCGGCTG
GCCGAGCGTC TTGAGGATGA ACCGGACGAT CTGGATGGTT GGATGCGGCT GGCAAATGCC
TATACCGTCC TTCAGGAGCC CGATCGCGCC GTTGAGGCCT ACCGCAAGGC TGAAGCGCTG
TTGGAGCAGC AGCCCGCCAG TGATCCTCGG CGTGCGGCGG TCCGTGCCGC GCTTGAGCGG
CTCGGCGGCT GA
 
Protein sequence
MIWGIFVLLT LVAIGIVLYP LLLSKSNTLT RGDAVPAILA DQMREIQRDM DRGLISEQEA 
QAARLEIKKR ILATTRNSEE KAGSSRNGGR VTLVVAAVLA PVIAAGYYLT MGSPEVPSMA
FAARAEERAQ TDEVTALAIQ LRERLVSDPT GGPSEGWMLL GQTYQRMGRA ADAVEAFEVV
AEREDATSAT FSMLAEAVAV ANDGVVIPRA KLAIDRALDL DPSNPAATYF ESLYYFQQEE
TRKAYDLLVS RLNQEEAYVP WMETYAAQIN RIAAAADLPV VNLPSGPTSQ PGPSAADVAA
ASEMNDADRA EFIRSMVSRL AERLEDEPDD LDGWMRLANA YTVLQEPDRA VEAYRKAEAL
LEQQPASDPR RAAVRAALER LGG