Gene Ccur_01920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_01920 
Symbol 
ID8374400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp234984 
End bp236333 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content49% 
IMG OID644993115 
Productarabinose efflux permease family protein 
Protein accessionYP_003150605 
Protein GI256826646 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value0.535895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCT CAACCGTTTC TGCCGGCAGC ATCGATGCGA TTTCTGAAGA AGAAAAGCGC 
CGCACGCTGA ACCGCGTAGC GTTTTCTTCC TTTCTCGGAA ATTTTATCGA GTGGTTTGAT
TATGCAAGTT ATTCCTATCT TGCAACGGTT CTTGCCGTTG TGTTCTTTCC TGGGGAAGAC
AAGGTAGTTG CCACGATGAG CACATTCGCC GTATTTGCGC TTGCATTTCT TGTTCGTCCT
GTTGGCGCCA TATTTTGGGG CAATATGGGA GATAAAAAAG GGCGTAAATG GGCACTGTCT
ATTTCTATCT TGTTGATGAG TGGTGCGACC TTTCTTATTG GTTGTTTGCC GGGTTATGCG
GTGCTGGGGG TAGGTGCTCC CATTTTGCTG CTGCTTTTGC GCATGGTGCA GAGCTTTTCT
GCATCGGGGG AATATGCAGG CGCTTCAACC TTTATTGCGG AATATGCTCC CAAACAGCGG
CGCGGTTTTT TCTGCTCATT CGTACCTGCT TCAACAGCAA CCGGTCTGCT GATCGGCTCC
CTTGCTGCAA CGCTCATGTT TAGCGTATGG GGAGCAAGTT CCGACTTTGT CGTGAATTGG
GGTTGGCGCA TTCCCTTCCT AGTTGCGCTA CCACTTGGCT ACATCACCCA TTACATCCGC
GTTCATCTGG AGGATTCACC TATCTATGCC CAGATGCAGG ACGCAATTGA TCAAAAGAGT
GCCGAAGCAG AAGATCATCC TATTCGCGCA CTGTTCACGA AGCATGGGCG GAAAACCATT
ATTTCGTTTG GCGCTTGCGT GCTTAATGCG GTGGGGTTCT ATGCCGTGTT GACCTATCTG
CCGAACTACC TTGAAGTAAC ACTCAACTAC GATTCTGCGG CAGCCTCTAC TATTACCACC
ATCGTGCTGG TGCTCTACAT TGCCTTTATT TTCACGGCTG GGCGTATATC CGACAAGGTA
GGACGCAAGC GGATGCTTAT AACTGCCTGC GTTGGCTTTA TCATCTTTAC CATTCCTGCG
TTCCATTTGC TTGCGAGCCA GGACTTCATT GTTATTTTGT TGGTCGAGCT GTCCATGTGC
ATGTTGCTTA CCATTAACGA TGGATCGCTT GCGAGTTATC TAACCGAGAC ATTTCCAACT
GAAGTGCGTT ATTCGGGATT TGCGTTTAGC TTCAATTTGG CAAATGCTAT TTTTGGCGGG
TCAGCTTCCT ACATCGGCTT TGCGCTTATC AACGCAACAG GCGATCCGGT TGCGCCGGCA
TACTATTTGG TTGCGATTGC TGCAATAGCG CTGGTGGCAA TGTTGTTGTC CCACGAGCAC
GCTGGTAAAG ATTTATCGCA TGTGAAATAA
 
Protein sequence
MDGSTVSAGS IDAISEEEKR RTLNRVAFSS FLGNFIEWFD YASYSYLATV LAVVFFPGED 
KVVATMSTFA VFALAFLVRP VGAIFWGNMG DKKGRKWALS ISILLMSGAT FLIGCLPGYA
VLGVGAPILL LLLRMVQSFS ASGEYAGAST FIAEYAPKQR RGFFCSFVPA STATGLLIGS
LAATLMFSVW GASSDFVVNW GWRIPFLVAL PLGYITHYIR VHLEDSPIYA QMQDAIDQKS
AEAEDHPIRA LFTKHGRKTI ISFGACVLNA VGFYAVLTYL PNYLEVTLNY DSAAASTITT
IVLVLYIAFI FTAGRISDKV GRKRMLITAC VGFIIFTIPA FHLLASQDFI VILLVELSMC
MLLTINDGSL ASYLTETFPT EVRYSGFAFS FNLANAIFGG SASYIGFALI NATGDPVAPA
YYLVAIAAIA LVAMLLSHEH AGKDLSHVK