Gene Ccur_03920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_03920 
Symbol 
ID8374600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp458921 
End bp460000 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content53% 
IMG OID644993316 
Productexopolyphosphatase 
Protein accessionYP_003150798 
Protein GI256826839 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value0.20783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC GCATTGCAGC AATCGATATC GGAACGGTTA CGTGTCGACT GCTTATTGCC 
GATGTTGCTG ACGCACGTAT CAACGAAGTA GCTCGTGAAT GCACCATTGT GAATCTTGGA
GAAGGAGTCG ATGCTACAGG AGTGCTTTCA CCAGCAGCGA TCGATCGAAC GGTGGACTGT
CTTGCTTCGT TTATGCACAC CATTGATGCT TACCGGCAGG ACGCTCCCAT AACGGTACGC
TGTATGGCGA CATCAGCGAG TCGCGATGCC CGCAATGCCA GTGAATTGAC GGCACGATTA
GCGCAGCTTG GCTTAACCTT GACGGTGATA TCCGGTGAAA AAGAAGCTGA GTTGTCGTTT
CAGGGTGCAA GTGCGGCCTT TCCGGGTGAA GAAGTGGCTG TTGTCGATGT TGGTGGTGGA
TCGACCGAGA TTATCGTTGG ACAGGGCGGC GCCGCACCCC GAAGGTCCCA TTCGTTTAAC
ATAGGTGCAC GACGGGCGAC AGAACGCTTC ATACAAACTG ATCCGCCCAG TGCTGATGAT
ATGAAGGCGA TACACGACTG GTGTCGTCCG GTCTTCGAGA GCTTCTTCGC TGCTTCAGGT
GCTTCAGGTA CGTGTCATTC GACTGGTAAT GCGTCCTGTA GTTCGTCCGA CGGTACTGGT
ATGTCCAATA AGGCTGGTAC GTCCGACGGG GTAAGTGCGC CTAGCAATGC CAGTATGTCT
AATAGTGTTG GCGCTCCTGA CAATGTCATC GTCCCGCCAC AGCGACTTAT TGCTGTTGCG
GGTACCGCGA CAACAGCGGT GTCAGTTGCA GATGCGATGG AAGCTTATGA TTCGTCGCGT
GTGCACGGGC GCGAAATGAA AGCAGCAGAG CTTGATAACC TCATTGATCG CCTTGCGGCA
TTGAATCTCG CTGAACGCGA AGAGGTTGTT GGCTTACAGC CACAACGCGC ACCTATTATT
GTGGCAGGTC TGCTTATACT TTCTGATGCC GTACACGCGG CAGGTACTGG CAGCTTCATT
GCAAGTGAAT CAGATATTCT TGCAGGCATC ATTATGGATA CCGCTTGCCG TTCTTCTTAA
 
Protein sequence
MEQRIAAIDI GTVTCRLLIA DVADARINEV ARECTIVNLG EGVDATGVLS PAAIDRTVDC 
LASFMHTIDA YRQDAPITVR CMATSASRDA RNASELTARL AQLGLTLTVI SGEKEAELSF
QGASAAFPGE EVAVVDVGGG STEIIVGQGG AAPRRSHSFN IGARRATERF IQTDPPSADD
MKAIHDWCRP VFESFFAASG ASGTCHSTGN ASCSSSDGTG MSNKAGTSDG VSAPSNASMS
NSVGAPDNVI VPPQRLIAVA GTATTAVSVA DAMEAYDSSR VHGREMKAAE LDNLIDRLAA
LNLAEREEVV GLQPQRAPII VAGLLILSDA VHAAGTGSFI ASESDILAGI IMDTACRSS