Gene Ccur_04410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_04410 
Symbol 
ID8374649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp516722 
End bp518140 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID644993365 
Productarabinose efflux permease family protein 
Protein accessionYP_003150846 
Protein GI256826887 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones108 
Fosmid unclonability p-value0.684912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGG TAAACACCCC CATTCAATCA GGAAATGCAC CACGCATCGT AGTTCTTATG 
ACCTCGCTGC TTGTTGCTAT CTTTGCTTTC CAACTTAATG CATCGATGCT TTCGCCAGCG
CTCGTCACTA TGCAAAACGA ACTTGCAACC ACCGAAGCAG CCATCGGTCT GACACAGACC
GTATTCTTTA CGGCAGCTGC CCTGTTTTCG CTGTTTCTTC CCCGCTTAGC TGACCTTATT
GGACGGCGCA AGGTACTGGG CAGCATGCTT GCACTCACCG GCGTTGGCTG CCTTATTTCG
GCCATAGCTC CCGACGTCAA TATTCTGATG ATTGGACGTG TACTGCAGGG TGTATCAGGG
CCTGTTGTAC CAATGTGCTT AATTATGCTG CACGAAGAAG TAACTGATAC CGCGCGCTAT
ACTCGCCTTA TGGCAATTCT GACTTCCGTT AATGGTGGCA TCGCTGGCGT TGACGCAATT
CTCGGCGGTT GGCTCGCGGG AAACTTTGGC TTCCGTTCGG TCTTTTTAGC CATGGCGCTC
GTCGCGATTG TAGCTGTTGT ACTGGTGCTT GCCTTTACCC GCGAAAGTAC CGCAGATGAT
ACTCCCAAGA TGGACTGGGC TGGCGTCTTT TCTCTTGGTA TTGCCTTTTT GGCAACCTAC
CTTGCCATCA ATGAAATTCA AAAGCTTGCA GCCATGAGTG TTCCGCTGGT TGTGGGCTTT
ATCGTAGTGG CTGTGATCGC GTTTATTGCG TTCTGGAATC TTGAGAAACG CAACCCAGCT
CCCATGGTAT CGACCATCTA CATGAAGCAA CGTCGTACCT GGGGTTTACT CTTGACGACC
TTACTTACCA TGACCGGTGT GTTCGCTGTG ATGAACGGTG TTGTCCCGGC AATCGCACAA
GACACCACCT TATGGAGCGG CATGGGCGCT GATGTCGTGT CGTTTGCCAC CTTGACCCCC
TATGCGCTGC TTGGCCTGGT GTTTGGACCG ATTACTGGCA TGCTCGCGAG CAAGTTCGGC
TACCATGCAG TGCTGCGCTG CGGCTTGGTG GTAAGCATCG CGGGCATTCT GTTTGGTGTC
TTTATTGCTC AGAGCCCAAG CATTCCCCTA TTGGTAGCCA TTTCAGTTGT TCTTGGTATC
AGCTATGCCG GTACCGCAAA TATCATGCTC AACGGCTTGG GAATTATGCT TTCACCGAAG
GACAATCCCG GTTATTTACC TGGTATGAAC GCTGGCGCAT TTAACTTAGG TGCAGGACTG
AGCTTTGTTG TTCTGTATGC CGTAATGGGT GCCGTTAATA TCGGGGGCGA TGTGGCAGCC
GGCTATGCTT CGTCCCTTAT CACCGGTGCC GTTATTTTAG TGTTTGCCCT GCTTGCTTCG
TTTTTGATCC CGCGTCCAGA AGACGCCGAT CGCAGCTAG
 
Protein sequence
MSQVNTPIQS GNAPRIVVLM TSLLVAIFAF QLNASMLSPA LVTMQNELAT TEAAIGLTQT 
VFFTAAALFS LFLPRLADLI GRRKVLGSML ALTGVGCLIS AIAPDVNILM IGRVLQGVSG
PVVPMCLIML HEEVTDTARY TRLMAILTSV NGGIAGVDAI LGGWLAGNFG FRSVFLAMAL
VAIVAVVLVL AFTRESTADD TPKMDWAGVF SLGIAFLATY LAINEIQKLA AMSVPLVVGF
IVVAVIAFIA FWNLEKRNPA PMVSTIYMKQ RRTWGLLLTT LLTMTGVFAV MNGVVPAIAQ
DTTLWSGMGA DVVSFATLTP YALLGLVFGP ITGMLASKFG YHAVLRCGLV VSIAGILFGV
FIAQSPSIPL LVAISVVLGI SYAGTANIML NGLGIMLSPK DNPGYLPGMN AGAFNLGAGL
SFVVLYAVMG AVNIGGDVAA GYASSLITGA VILVFALLAS FLIPRPEDAD RS