Gene CPS_5026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_5026 
Symbol 
ID3520868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp5341158 
End bp5342744 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content36% 
IMG OID637287465 
Productchain length determinant family protein 
Protein accessionYP_271665 
Protein GI71280402 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATA TATTTGAAGA GATAATTGAT TATCTAAAAG GAATTTGGTT AAAACGTCGT 
TATATCATCA TAGCAACTTG GCTTATTTGC CCTATTGGCT GGTATTTCGT TGCTGCTATG
CCCAATGTTT ATCAATCAGA AGCAAGAGTA TATGTTGATA CCCAGTCTTT ATTGCGGCCT
TTATTAAAGG GTTTAACCGT CGAAACCAAT CCTGATACGC AAATTCGTCT TATGGTAAAA
ACACTATTAA GCCGTCCGAA TTTAGAGCGT ATCTCGCGTA TGACAGATTT AGATGTACAG
GCTAGCACTC CTGCTCAATA TGAAGCAATA ATCAAAAACC TAAAAGATAA CATTAAAATA
TCAAGCGCTC GCCGAGAAAA CATTTATACC TTAAGCATTG AAGATGAAGA TCCTGAAATG
GCAAAAAACA TAGTTCGCTC AGCGTTGACT GTTTTCATTG AAAATACTTT AGGCGAAACT
CGAAGTGACT CAGATAGTGC GCAAAAGTTT TTAAATACTC AAATCAAAGA TTATGAAAAC
CGCCTGTCAA ATTCAGAAGC CCGACTGACG AGTTTCAAAC AAAAGTATAG TGGTATTTTA
CCTGATCAAT CAGGTGGCTA TTATGCAAAG CTCAATGGTA ATAGAGAAAA ATTAAAAGCA
ATTGAGTTGG ATCTTTTGGA GAATAAAACC CGTCTTGATT CCGCTAAAAA GCAGCTAGCC
CAATCCGTAG TAGCTGACAC TGGAAGTGAT AACAAAATTA AAAGTGAAAA TTCAATACAA
ACTACTTATG ATGACCGTAT AAATGAATTA GAAGTTCTAC TGGATAATTT AAAGCTTCGT
TATACCGAGA AACACCCAGA TGTGATCGAA ACTAGCCGAA ATTTAGAACA TTTAAATAAA
TTGCGTAGCT CTGAAATTGA AAAGTATCTA GCACAAAATA ATGAGAACAC TGGCAGTTTA
AAACGCTTAA GTGCTAATCC CGTTATTCAA GAAGTACAAA TTCATGTTAA TCAACTGGAG
AATTTGATTG CTTCACTCAA TGTTAGAGCT GACAATTACC GCCAACGCAT TATCGAACTT
GAAAATAGAG TCCATACTTT ACCGGAAATT GAGGCTGAGT TAATCGCACT AAACCGTGGA
TATGGTATTA CCAAAGAAAA GTACGAAGAG CTGTTATCAC GTAAAGAAAC TGCACAACTG
GCTCAACAAG CCGATGAAAC TACTGATAAA ATTCAATTTC GTGTTATTGA CCCACCACGT
AAAGCAACTA AACCCTCAGG GCCAAAGCGC TTTCTTTTAT TTGGTGTCGT CACTCTTATA
GGGTTTGCTG TTGGCATTGG TTTGTCATTA TTAATGAGTC AATTGAACCC TGTAGCAACC
TCTACCGCTC AATTATCTAA AATTACCGGC GTACCTATTT TTGGTGTTAT CTCTGCCAAT
GAAAACTTAG GTTTACAACA ATGGCATAAA AAGAAAACTT TCATTTTTAT AGGCTCTAAC
GTGCTATTAC TCTGCTTGTT AGTGTTATTT GTTAGTTACT TTTTATTTCC TGATACTATT
CAAGCACCAT TAAAAAGGAT ATTTTAG
 
Protein sequence
MQDIFEEIID YLKGIWLKRR YIIIATWLIC PIGWYFVAAM PNVYQSEARV YVDTQSLLRP 
LLKGLTVETN PDTQIRLMVK TLLSRPNLER ISRMTDLDVQ ASTPAQYEAI IKNLKDNIKI
SSARRENIYT LSIEDEDPEM AKNIVRSALT VFIENTLGET RSDSDSAQKF LNTQIKDYEN
RLSNSEARLT SFKQKYSGIL PDQSGGYYAK LNGNREKLKA IELDLLENKT RLDSAKKQLA
QSVVADTGSD NKIKSENSIQ TTYDDRINEL EVLLDNLKLR YTEKHPDVIE TSRNLEHLNK
LRSSEIEKYL AQNNENTGSL KRLSANPVIQ EVQIHVNQLE NLIASLNVRA DNYRQRIIEL
ENRVHTLPEI EAELIALNRG YGITKEKYEE LLSRKETAQL AQQADETTDK IQFRVIDPPR
KATKPSGPKR FLLFGVVTLI GFAVGIGLSL LMSQLNPVAT STAQLSKITG VPIFGVISAN
ENLGLQQWHK KKTFIFIGSN VLLLCLLVLF VSYFLFPDTI QAPLKRIF