Gene Dshi_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3778 
Symbol 
ID5714307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp176766 
End bp177917 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content60% 
IMG OID641276693 
ProductTPR repeat-containing protein 
Protein accessionYP_001541989 
Protein GI159046317 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGG GTATTTTCGT TCTTTTGACG CTGGTCGCAA TCGGAATTGT CCTCTACCCG 
CTGCTGCTGT CCAAATCGAA CACTCTCACC AGAGGGGATG CGGTACCGGC GATCCTCGCC
GATCAGATGC GGGAAATCCA ACGAGACATG GACCGCGGTT TGATTTCTGA ACAGGAAGCT
CAAGCGGCCA GACTCGAGAT CAAGAAACGG ATCCTCGCGA CGACCCGCAA TTCGGAAGAA
AAAGCCGGAT CATCCCGTAA TGGTGGCAGA GTCACTCTCG TTGTCGCGGC GGTCCTCGCG
CCGGTTATTG CCGCGGGCTA CTATCTGACG ATGGGATCGC CGGAGGTTCC TTCAATGGCC
TTCGCTGCCC GTGCAGAGGA GCGGGCACAG ACTGATGAAG TGACGGCGCT GGCAATACAA
CTTCGCGAAA GGCTTGTATC CGATCCGACC GGCGGCCCTA GCGAAGGGTG GATGCTGTTG
GGCCAGACCT ATCAGCGCAT GGGCAGAGCA GCCGATGCCG TCGAAGCCTT CGAAGTCGTT
GCCGAACGGG AGGATGCCAC CTCCGCGACA TTCTCCATGC TGGCAGAGGC TGTCGCGGTT
GCCAATGACG GCGTTGTTAT TCCACGGGCG AAATTGGCCA TTGATCGCGC TTTGGACCTT
GACCCATCCA ATCCTGCGGC AACCTACTTC GAGTCTCTGT ATTACTTTCA GCAAGAAGAG
ACGCGTAAGG CCTACGATCT GCTTGTTTCA CGACTGAATC AGGAAGAGGC TTACGTTCCC
TGGATGGAGA CCTATGCAGC GCAGATCAAC CGGATTGCAG CGGCAGCTGA CCTTCCGGTC
GTCAATCTAC CCAGCGGGCC GACGTCGCAA CCAGGCCCGA GTGCCGCAGA TGTCGCGGCG
GCGTCGGAGA TGAATGACGC GGATCGCGCG GAGTTCATCC GATCGATGGT CTCGCGGCTG
GCCGAGCGTC TTGAGGATGA ACCGGACGAT CTGGATGGTT GGATGCGGCT GGCAAATGCC
TATACCGTCC TTCAGGAGCC CGATCGCGCC GTTGAGGCCT ACCGCAAGGC TGAAGCGCTG
TTGGAGCAGC AGCCCGCCAG TGATCCTCGG CGTGCGGCGG TCCGTGCCGC GCTTGAGCGG
CTCGGCGGCT GA
 
Protein sequence
MIWGIFVLLT LVAIGIVLYP LLLSKSNTLT RGDAVPAILA DQMREIQRDM DRGLISEQEA 
QAARLEIKKR ILATTRNSEE KAGSSRNGGR VTLVVAAVLA PVIAAGYYLT MGSPEVPSMA
FAARAEERAQ TDEVTALAIQ LRERLVSDPT GGPSEGWMLL GQTYQRMGRA ADAVEAFEVV
AEREDATSAT FSMLAEAVAV ANDGVVIPRA KLAIDRALDL DPSNPAATYF ESLYYFQQEE
TRKAYDLLVS RLNQEEAYVP WMETYAAQIN RIAAAADLPV VNLPSGPTSQ PGPSAADVAA
ASEMNDADRA EFIRSMVSRL AERLEDEPDD LDGWMRLANA YTVLQEPDRA VEAYRKAEAL
LEQQPASDPR RAAVRAALER LGG