Gene Caci_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2223 
Symbol 
ID8333572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2523091 
End bp2524428 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID644955377 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003112983 
Protein GI256391419 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00013617 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000156735 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTCA GGGTGCTGCG GGTGCTGGAT GTTGGGGGGT TGCCGCGGGT TTTTTGGTGG 
GTGTGGGTTA GTACGTTGGT GGCGCGGACG GGGGCTTTTG TTGCGCCGTT TCTTTCCTAC
TACCTGACGC GGTCGCTGGG GCATTCGGCG GCTTTTGCGG GGTTCGTTGC CGCGTTGAAT
GCGGGCGGGG CGGCTGTTTC GGCGGTGGTG GGGGGTGTGC TGGCTGATCG GGTGGGGCGG
CGGGGGACGT TGCTGGGGGC GCTGGTGGCG TCGGCGGTGA CGTTGGTGGC GCTTGGGTCG
GTGCACTCGG TGGGGTTGAT TGCGGTGTTG GCGTTCCTTG CGGGGTTGGC CAACAATGCG
ACGCGGCCGG CGACGGGGGC GATCATCGCG GACATCGTGC CGTCGGGGGA TCGGGGGCGG
GCGTATGCGC TGAACTACTG GGCGATCAAC TTGGGGTTCG CGGTCGCGAT GCTGTCTGCG
GGGGCGGTGG CGTCGCACGG GTATTCGCTG CTGTTCATGG GTGATGCGAT CGCGAATGTG
GGGTGCGCGG TCGTCGTCTT CTTCACGGTG CCGGAGACGC GGCCGACTTC TGTTGGTGTC
GCCGGCGCAG GGCATGCCGG TCAGCCCGAG CGTGCGGGCA CTCTCGTGGA TGTGCTGCGC
GACCGGATCT TCTTGGGGTT CCTGGGGGCG GTGCTGGTCG GGGCGGTCAT CTATTCGCAG
GCTCAGACCG TGCAGCCGAT CATGATGGGT CAGGACGGTC TTGGGCCTGG TGCATATGGC
GCGGTCGCGG CGCTCAATGG GATCCTCATC GGTGTTCTGC AGTTGCCGAT GACGTCGTGG
ATGCGGCGAT ACACCCATGG GTCGGTGCTG GCGGCCTCGT CGTTTCTGAT GGGCGCCGGG
TTCGCTGTGC CTTTGCTGAT CTCGGCGGTG GGGCACCCGA TGGGGGTCTA TGCCGGCTCG
GTGGTGGTGT GGACGATCGC GGAGATCGGG AGCACGCCGC CGCAGATGGC GCTGGGTGCG
GATCTGGCGC CGGCGCATCT GCGCGGGAGG TATCAGGGGA TGTCGACGCT GGCGTGGAGT
GTGGCCGGCA TCGTCGGTCC GTTGGTGGGC GGCTGGGCAC TGACGGCGAT CGGTGCTTCG
GCCGTGTTGT GGGCGAGCCT GCTGCTCGGC GCGGCAGGTG TGCCGGCGTG GGTGATGCTC
GACCGCCGAT CGAGAACACG AGTGGCCACG TTGCGCGCCG CCGAAGCGCA CTGGGAACCG
GTGTTGTCCG CCATCGCGTC ACCCGAAGTG GTGTCGGCCG GTGTGCCGAC GTCCGAGCCG
GAGCCGGAAC CGGTGTGA
 
Protein sequence
MRVRVLRVLD VGGLPRVFWW VWVSTLVART GAFVAPFLSY YLTRSLGHSA AFAGFVAALN 
AGGAAVSAVV GGVLADRVGR RGTLLGALVA SAVTLVALGS VHSVGLIAVL AFLAGLANNA
TRPATGAIIA DIVPSGDRGR AYALNYWAIN LGFAVAMLSA GAVASHGYSL LFMGDAIANV
GCAVVVFFTV PETRPTSVGV AGAGHAGQPE RAGTLVDVLR DRIFLGFLGA VLVGAVIYSQ
AQTVQPIMMG QDGLGPGAYG AVAALNGILI GVLQLPMTSW MRRYTHGSVL AASSFLMGAG
FAVPLLISAV GHPMGVYAGS VVVWTIAEIG STPPQMALGA DLAPAHLRGR YQGMSTLAWS
VAGIVGPLVG GWALTAIGAS AVLWASLLLG AAGVPAWVML DRRSRTRVAT LRAAEAHWEP
VLSAIASPEV VSAGVPTSEP EPEPV