Gene Ccur_11020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_11020 
Symbol 
ID8375309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1253669 
End bp1256545 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content51% 
IMG OID644994024 
ProductYhgE/Pip-like protein 
Protein accessionYP_003151475 
Protein GI256827516 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03061] YhgE/Pip N-terminal domain
[TIGR03062] YhgE/Pip C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.458925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTACGG TGCTGCGCAT TTTTCAGCGT GACATAGTGC GTCTAATAAA AAATCCGTTT 
GCCCTGGTGG TTATTGTGGC AATGTCGGTT CTGCCGGCAT TGTATGCATG GTATTGCATT
GAAGCAAACT GGGACCCTTA CGAACATACC AATGGACTGC ATATTGCCGT CGCTAACGAA
GACGTTCCGA CAACGGTCGA AGGGATGGGA TCGGTCGACG TCGGCGGTCA AATAGTTGAT
CAGTTATCAC AAAACAGTCA GTTTAGCTGG GAATTTGTTT CAAAAGGTGA AGCGATAGAC
GGAGTTAAGT CAGGACGCTA TTACGCTGCC TTTGTGATCC CCTCGTCGCT TTCTGCTGAC
TTTACCAGCG CACTGTCGGG CGACCGACAT GAGCCACATA TCGATTACTA CGTCAATGAA
AAGTACAGTT CAACCGCTGT AAAAGTAACC GATGCTGGAT CAACGGCAAT CGAGCGTCAG
GTAGAGGAAG CCTTTTCAAA AGCGGTATCC GAAGCGCTTA TCGGTTTGTT GAAAGACCGA
GCTGGCGACA TTGAAAGCCA GATTGATACA GCTGATGGGC GTCTTTCTCA AACGGTCACT
GGGGCAAATG AACAGGTGGC TCAAATCATC GATACGCTAC AGGACGTTTC GCAGTCTATC
GACTCGTGGT CGCATGTCGT TGACAATGCT CAAGAAACGC TCGATGTGCT TGAAACTGCC
GTGCCGCAGG CATCAACGTC TATCGATGAT GCAGCGCGCG TTCTTTCTCA GGTACGCACT
TCATCGGATG GCTTCAGTTC ATCGCTCGCA GGGACCCTTA CCCAGAGTGG TGCGCTTATA
GGGAGTACTT CTTCGCAGGT AAATAGCCAG GTTAACGATG CTGTTGGACG CATTATGGTG
GCGAAGGCTG ACGTGGATAG TGTGCTTACC CGTCTTCAAG GGGTTATCGC GCAGAACGAC
CAGATTATTA GTCAGTTGCA AGATGCCGCT GCGGCACTGC CGGCGGGCAG CGCGAAGACT
ACCGCGCTCA ATACGATTGC GCGTTTGCAA TCCCATAACG ATACGCTCAG GCAGCATGTC
AATAACCTCT CAAATGTCAG TACGAGCCTC GGTCGTGCTA CTCTTGCTGC GCAGAATCTT
TCGAATGCTA TGAACACGAC AGTGCAGACG GGTGCTCAAT CCATCATGGT CACGGGAACA
GCTCTTGCTG CAAGTGCGTC GCGTATCGGC GGTTCGCTTG ATTCGCTCGT GCTGTCGATG
GGCGGTCTCT CTGGGGCAGT TTCTGGACTA GGCCCGCAGG TGGAAGAAAT TTCTTCGTTG
CTTTCACAGG CGCAGGGTGT CTTTAGCCAG GTTCAAGCGT CTCTTTCTAA CACGGAAAGT
TCTCTTGGTG CGGTACAAAA GCACGTTGAC GACACTGTTG ATGATATCCG CGCCATCGCT
TCAGCGCTCG AAGCCGATCA ACTTTCTGTG GTGCTTGGTC TTGACGTAAC AAGCATTGGC
GAGTTCATGT CGTCGCCGGT TACCCTTGAT ACCGAAGTGC TCTATCCGGT GGAGCATTAT
GGTGCAGCCG TTGCCCCGTT CTATACCAAT CTGGCTATTT GGGTCGGCTG CTTTATGTTG
ATAGCTATTC TTAAGATGGA AGTCGATCGC AAAGGGTTTG AATCGATGAC AGCTGTGCAG
GCTTATATGG CTCGCTGGCT GCTGTTCGTT TTAGTTTCCT TAGTTCAGGC GGCGATTATC
TGTATCGGTG ATGTGGCATT GGGCATTGGC TGTGTCGACC CGATATCGTT TGTTATTGCG
GGGCTCGTTA CAAGTTTTGC CTTTGTGAAC GTGGTTTTCA TGTTAGGGAT TACCCTGAAA
CACATTGGCA AGGCGCTTGC TGTGCTGTTG CTTATCATGC AGATTCCTGG TTCGTCGGGT
ATGTACCCTA TCGAAATGAT GCCGTCCTTT TTCCAACAGG TACATCCACT CTTACCCTTC
ACCTATGGTA TTTCTGCCAT GCGCGAGGCA ATTGGGGGCA TGTACGGCGG TGCATATGCC
GCCAACATAG TGGTGCTTTT GCTTATCGCG GCGGTTTCGC TACTCATCGG CTTATTTGTC
CGTCCGTATG TTCTTAATTT GAACGTGTTG TTTGACAAAC GATTACGCGA AACGACCTAT
ATGGTCAATG ACGATCAAGG CCTGCGCGAA CCGCGTTATC GTGTGCGCAA CGTGGTGCGT
GCTCTTTTGG ACAACGACGA ATACCGAAGC GTTTTGCTTG GTCGTGCGAC GCGCTTCAAC
CGTCGCTATC CATCTATTGC ACAGATCAGT CGTATTTCAA TAATCGTCCT TCCACTTGTT
GCTCTGGTTA TCATGTGCCA ATTTTCGCTG GGGCCGAACG GCAAGGTGCT CATGATTGTG
GGCTTTGTCA CGCTGGTTGT TTTGCTGGGA TGGGCGCTGG TCCTTATCGA GTACTTCCAT
GATTACTTGA ATAACCAGGT GCGCATGGTT GCGCACAGTG GAGAAGATTT GATGGCTGAT
GTGATGGGTC ACACTCCACT GCGCCACAAG GATTTTTCTG CCAACCACGC CGAAAATGTG
CGTGCGGTAC TTTCCCGTTT GCGTGAAGTA CCTCAGAAGA ATGCCGCTAC AGAAGAGCTC
TCCTCGGGCG AGGTCTTGAA TGATGTTTCG AGTGATATTC CAAGTAAAGC GTTAAATGGT
GCTTCAGATG CAGTATTAGA TGGTGCTGCG AGTGACGCTT CGGATGAAAC TTTCCATAAG
GTGCCGAACG GCATTTCAAG CGATACTGCA GAGACTCCTT CGGACAGCAC TTCAGATGGT
GCTTCGAAAA CGTCAGACGA TGCTTCGGAC GCGTCTATTG ATAAGGAAAA GCGGTGA
 
Protein sequence
MGTVLRIFQR DIVRLIKNPF ALVVIVAMSV LPALYAWYCI EANWDPYEHT NGLHIAVANE 
DVPTTVEGMG SVDVGGQIVD QLSQNSQFSW EFVSKGEAID GVKSGRYYAA FVIPSSLSAD
FTSALSGDRH EPHIDYYVNE KYSSTAVKVT DAGSTAIERQ VEEAFSKAVS EALIGLLKDR
AGDIESQIDT ADGRLSQTVT GANEQVAQII DTLQDVSQSI DSWSHVVDNA QETLDVLETA
VPQASTSIDD AARVLSQVRT SSDGFSSSLA GTLTQSGALI GSTSSQVNSQ VNDAVGRIMV
AKADVDSVLT RLQGVIAQND QIISQLQDAA AALPAGSAKT TALNTIARLQ SHNDTLRQHV
NNLSNVSTSL GRATLAAQNL SNAMNTTVQT GAQSIMVTGT ALAASASRIG GSLDSLVLSM
GGLSGAVSGL GPQVEEISSL LSQAQGVFSQ VQASLSNTES SLGAVQKHVD DTVDDIRAIA
SALEADQLSV VLGLDVTSIG EFMSSPVTLD TEVLYPVEHY GAAVAPFYTN LAIWVGCFML
IAILKMEVDR KGFESMTAVQ AYMARWLLFV LVSLVQAAII CIGDVALGIG CVDPISFVIA
GLVTSFAFVN VVFMLGITLK HIGKALAVLL LIMQIPGSSG MYPIEMMPSF FQQVHPLLPF
TYGISAMREA IGGMYGGAYA ANIVVLLLIA AVSLLIGLFV RPYVLNLNVL FDKRLRETTY
MVNDDQGLRE PRYRVRNVVR ALLDNDEYRS VLLGRATRFN RRYPSIAQIS RISIIVLPLV
ALVIMCQFSL GPNGKVLMIV GFVTLVVLLG WALVLIEYFH DYLNNQVRMV AHSGEDLMAD
VMGHTPLRHK DFSANHAENV RAVLSRLREV PQKNAATEEL SSGEVLNDVS SDIPSKALNG
ASDAVLDGAA SDASDETFHK VPNGISSDTA ETPSDSTSDG ASKTSDDASD ASIDKEKR