Gene Caci_4723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4723 
Symbol 
ID8336077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5386679 
End bp5387920 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content74% 
IMG OID644957823 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003115425 
Protein GI256393861 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0900603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGTGA CCACAGTCGA ACATCAGGGA CACCTGCCCG GTACCGCCGA ATACCGCCGT 
GTCCTCACCG CCCTGTTCGC CGCCGGCATC GCCACCTTCG TCCTGCTCTA CGACACCCAG
GCGCTGCTCC CCGACCTCGC GCACGCCTTC CACGTCTCCC CGGCCGAAAG CACGCTCTCG
GTCTCGGCGA CCACCGTCGG CCTGGCCGTG GCGCTGCTGG TGTTCGGACC GCTGTCCGAA
GCGCTGGGTC GCACGGTCCT GATCCGCTTC TCCATGGCCG CCTCGGCGGT GCTGGCGCTC
GCCTGCGCGG CGGCTCCCAC CTGGGACTCG CTGATCGCGA TCCGGCTGCT GGCCGGGGTG
GCGCTCGCGG GTCTGCCGGC GGTCGCCACC GCCTACCTGC GCGAGGAGAT GCACCCCTCG
GCGCAGGCGC GCGCCGCAGG GCTCTACATC GGCGGGACCG CGATCGGCGG GATGGCGGCG
CGGCTGGTCA CGGCGCCGAT CGCCGAGGCG GCGGGGTGGC GGTGGGCGCT GCTGGCCGCT
GCGGCGCTGT CCACGGCGTG CGCGGTCGTC GTCGCGGTCA CCCTGCCGCC GTCGCGCCAC
TTCGTGGCGA CCCGGCTGCG CGGCCGGACC GTGCTCGCGA TGCAGCGCCG GGCCCTGGCG
GACCCGGCGC TGCTGGCTCT GTACGCGCTC GGCGCCTGCG CGGTCGGCGC GCTGGTGGCG
GTGTTCAACG CGGTGGGCTT TCGGCTCACC GCCGCGCCGT TCCACCTGGG GGTCGGGCTG
GTGAGCCTGA TATTCCTGAC CTACTCGCTG GGGACCGTCA GCTCGACGGT GTCCGGGCGG
CTGGCCGACC GGCTCGGGCG GCGGGCGATC GCGCCGATCG GGTGCGCGGT CGCCTTCGGC
GGGGTACTGC TGACGCTGAC CGGCTCGCTG CCGGTGGTGA TCGTCGGGAT CGCGGCGCTG
ACCGTCGGGT TCTTCGCCGT GCACGGCCTG GCCAGCGGCT GGGTGACGGC GCGCTCGCAC
GCCTCCGGCG CCAGTCCCAG CCAGGCCGCG GCGTTCTATC TGTTCTCGTA CTACGTCGGC
TCGTCGGTCT TCGGCAACAT GGGCGGCCGG GCCTGGTCGG CCGACGGGTG GCCGGGCGTG
GTCACGGTAG CGGGCTCGTT GCTGGGGATC GCTGGAGTAC TAGCGCTGGC GCTGCGTCGG
ATCCCGCCGC TGGTCCCGCC GTCGCCGGCG GCCGTGCCGT GA
 
Protein sequence
MPVTTVEHQG HLPGTAEYRR VLTALFAAGI ATFVLLYDTQ ALLPDLAHAF HVSPAESTLS 
VSATTVGLAV ALLVFGPLSE ALGRTVLIRF SMAASAVLAL ACAAAPTWDS LIAIRLLAGV
ALAGLPAVAT AYLREEMHPS AQARAAGLYI GGTAIGGMAA RLVTAPIAEA AGWRWALLAA
AALSTACAVV VAVTLPPSRH FVATRLRGRT VLAMQRRALA DPALLALYAL GACAVGALVA
VFNAVGFRLT AAPFHLGVGL VSLIFLTYSL GTVSSTVSGR LADRLGRRAI APIGCAVAFG
GVLLTLTGSL PVVIVGIAAL TVGFFAVHGL ASGWVTARSH ASGASPSQAA AFYLFSYYVG
SSVFGNMGGR AWSADGWPGV VTVAGSLLGI AGVLALALRR IPPLVPPSPA AVP