Gene Ccur_13540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_13540 
Symbol 
ID8375559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1530089 
End bp1531480 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content52% 
IMG OID644994270 
Productarabinose efflux permease family protein 
Protein accessionYP_003151711 
Protein GI256827752 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones122 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAG AAAAGCACGG CTTCGGTCGT CTTGCTGTCA TATACCTGAT TGGTCTTCTT 
ATCAGCGGAC TCTATGTTGG GCTTATCGCT CCGCTGCGGC TGGTTATTCA GGCAGACTTT
GGCATCGATG ATGCCCGGGG CATTTGGATG GTCAGTGTCT ACACGCTTTT TTATGCAGCG
CTTATTCCTG TTTCAGGTAA CCTAGCCGAC CGATATGGTC GCAAAGTGGT GTATCTGTTT
TGTTTACTAG TCTTTGGAAC CGGTTCGCTC ATCTGCGGTT TTTCGCAACA GCTTGGAAGT
TACGAGACCT TACTTGTAGG GCGTATCGTG CAAGCTGCCG GAGCAGGCGG CATTATTCCA
GTAGCTACCG CTGAAGTAGG AATGGCTGCT CCGCAGGGTA AGCGCGGTAT GTGGCTTGGC
ATCGCGTCGT CGGTGGCAGG CGTTTCAAAT GTTGTCGGTG CCGCCGCTGG TAGTGGCATT
GTTGGTCTGG TTGGTGTCGA TCAGTGGGGA TGGGCCTTTT TCTGTTCAGC CCCTATTTCC
CTTGTATTGG CGCTTGCCGC TCTCATATGC TTGCCAAAGG GAACTCCGCA GCGGCGGGGG
CGCCTCGACA TTGCAGGATC GGCCCTATTC ACGATCGTGT TGCTTTCAAT GCTTGTCGGA
TTACGCGAGC TCGATTTTTT CCATATCAAT ACGATTGTTT CTATCGATGT TTGGGGGCCA
CTGCTGGTGG CAGTCGTGCT TGCTGGTGCT TTTCGCGAGG TGGAGCGACG GGCGAGCGAC
CCAATTTTTC ATATAGAGTA TCTTGCTGAC CGCCACATCG TTTTAATTCT TGCGATTGCC
TTTTTTGTGG GCTGTAGCAT TATCAGTATG GTGCTGGTGC CGCAGTTTGC TGAAGCGCTG
CTCGACTTGC CCGTTGGATC GGGCGGTTAC CATATGGCTG TTTTGGGAGT GGCTGCCTTT
GTTGGCCCGC CGCTCAGTGG CAAGCTTATC GATACTCATG GTCCCAAGAT GCCGCTTGCG
TTCGGTTTGG CCATTACAAC GATAGGTTTT GTATTCCTTG CTGTAGTTGC TATAGCGGTT
CCCTCGCTTC CGATTGTGCT TATCGGGTTA GCAATAGTCG GTTTCGGTAT GGGATTTAGC
ATGGGAACCC CACTCAATTA TATGATGTTA CAAAATACCG AGCCCGAGCA GAGTACGTCG
GCCATCGCTA CCCTTGCGTT GGTACGACAG GTGGGTACCA CCATCGCTCC CGCTATCCTG
GTTGGTTTCG TATCGGCATC TGCTGGTATG GCGGGGTATC GCAATATGCT GCTCGCGGTG
GCGCTTTTCA ATGCAGCGGC TTTTATCATG CTGCTGTTCT ATCGATCTTC TCGTTCGAAG
GCGCCTGTAT AG
 
Protein sequence
MDKEKHGFGR LAVIYLIGLL ISGLYVGLIA PLRLVIQADF GIDDARGIWM VSVYTLFYAA 
LIPVSGNLAD RYGRKVVYLF CLLVFGTGSL ICGFSQQLGS YETLLVGRIV QAAGAGGIIP
VATAEVGMAA PQGKRGMWLG IASSVAGVSN VVGAAAGSGI VGLVGVDQWG WAFFCSAPIS
LVLALAALIC LPKGTPQRRG RLDIAGSALF TIVLLSMLVG LRELDFFHIN TIVSIDVWGP
LLVAVVLAGA FREVERRASD PIFHIEYLAD RHIVLILAIA FFVGCSIISM VLVPQFAEAL
LDLPVGSGGY HMAVLGVAAF VGPPLSGKLI DTHGPKMPLA FGLAITTIGF VFLAVVAIAV
PSLPIVLIGL AIVGFGMGFS MGTPLNYMML QNTEPEQSTS AIATLALVRQ VGTTIAPAIL
VGFVSASAGM AGYRNMLLAV ALFNAAAFIM LLFYRSSRSK APV