Gene Cpin_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3535 
Symbol 
ID8359702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4399944 
End bp4401164 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content46% 
IMG OID644965706 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003123200 
Protein GI256422547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.156699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.255134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACAAA GTGCTTCAGA GAAAAGGAAA AGAATCGCAA CGACCTTAGC TTTTATTTCA 
ATTCCGCTAT CGGGATTTGT TACGGATATC TATTTACCAT CCTTCCCCTC CATGGCCAAA
GGCCTACAGG TCGAAGAAAG CAGCATCCAG CTCACACTGA CCTGTTTTTT CATCAGTTAC
GGTATCTCCC AGTTACTTGT TGGTAGTTTG CTGGACAGTA TAGGCAGATT TAAACCTGCC
ATGTATGCCC TGGTAGCCGT GATTATTACT TCCATGCTGA TTGGTTTTAC CAGTAATGTC
CAGTTGATCT GCCTCCTGAG GGTGATCCAG GGTATTGCCG TATCATTTGT AGTGGTGTCA
AAACGCGCCT TCTTCGTGGA CATGTACAGC GGAGATAAAC TAAAAAATTA CCTGAGCTAT
TTCACGATCA TATGGTCCTG CGGACCGATC ATCGCCCCTT TCCTGGGCGG CTATCTACAG
AAGTATTTTC ACTGGCAGGC CAACTTCTGG TTCCTGGCAG CATATGCCCT CGTGATGCTG
ATATTTGAAC TGATCTATAG TGGAGAGACC ATCAAAGCAC GTAAACAGTT TGATATAGCA
AGGGTTAAGA AGGACTATAG CATGGCCCTG GGTAACCTGA ACTTCGTGCT TGGTATCGTT
ACCCTCGGAC TAGCCTATTC TGTGGTAATG GTGTTCAATA TCGGCGGTCC CTTTGTTATC
GAAAATGGCT ACCATTTTAA CTCGATCGTT ACCGGCTATT GTACCCTCAT CCTTGGCTTC
TCCTGGCTGG TAGGTGGCGT CATAGGTAAG CGCCTGGTAG CAGTAGATTT TGTACGTAAG
ATCACCGTCG CTTCTGCCAC ACAATTACTC CTGATCGTCA TACTGATGTT CATCGGTTTC
GGATACCCTG CGCTGTGGTG CTTTATGTTA TTCGCATTTG TCATTCACAT CTGCTCCGGA
TTCCTGTATA ATGTGCACTT CACACAGAGC ATGCTGTACT TCCCGGAACA TGCAGCAATG
GCCTCTGGTT TACTGGGTGG GTCGGTTTAT GTGATCACCT CATTTTCCAG TTTCATTCTT
TCAAAGGCTG GTAAGATGGG ACATCAGCAG GATATCACAT TACGTTACCT GGTGGTCTCC
GCGATCCTGA GTGCAGTAGT ATGGTATGCA GTAAAGAGAA GAAGGACATT GCGAAAGGTC
AATACAGCCA TCGCAGCATA A
 
Protein sequence
MLQSASEKRK RIATTLAFIS IPLSGFVTDI YLPSFPSMAK GLQVEESSIQ LTLTCFFISY 
GISQLLVGSL LDSIGRFKPA MYALVAVIIT SMLIGFTSNV QLICLLRVIQ GIAVSFVVVS
KRAFFVDMYS GDKLKNYLSY FTIIWSCGPI IAPFLGGYLQ KYFHWQANFW FLAAYALVML
IFELIYSGET IKARKQFDIA RVKKDYSMAL GNLNFVLGIV TLGLAYSVVM VFNIGGPFVI
ENGYHFNSIV TGYCTLILGF SWLVGGVIGK RLVAVDFVRK ITVASATQLL LIVILMFIGF
GYPALWCFML FAFVIHICSG FLYNVHFTQS MLYFPEHAAM ASGLLGGSVY VITSFSSFIL
SKAGKMGHQQ DITLRYLVVS AILSAVVWYA VKRRRTLRKV NTAIAA