Gene Daro_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0358 
Symbol 
ID3569831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp390481 
End bp391683 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID637678800 
Producthypothetical protein 
Protein accessionYP_283587 
Protein GI71906000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.23836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000671481 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAACTCAC AGAGAGAACG CCTCAAGGTG CTGAGCGCCG GGATATTCAG CCTGCTGCTG 
ACCTTTGGCG TGGCCCGCTT TGCCTACACG CCCTTGCTGC CGATCATGCA GCAGCAGGCC
GGGCTGGGGC TGGCCGAAGC CGGCTGGCTG GCGGCGCTCA ACTACGCCGG TTATCTCAGT
GGCGCACTGA TTGCCTCGCT GATCAGCAAC CTGGTGCTCA AGGACAAGCT GTACCGGATC
GGCCTGGTGG TCGCCATCCT GAGCACGGTG ATGATGGGGC TGACCACCGA CCCGCTGCTC
TGGATGGCCT CGCGCTTCAT CGCCGGCCTC TCCAGCGCGG CCGGCATGCT GCTCGGCACC
GGGCTGATCC TCAACTGGCT GATTCGCCAC AACCACCGGC CGGAGTTGGG CATCCACTTT
GCCGGCATCG GGCTGGGCAT TTCCGGTTGT GCCGTGGCGG TGTGGCTGAT GGGCGGCTGG
CTGGACTGGC GCGAGCAGTG GTTCGCCTTT TCGGCCATTG CTTGTCTGCT CATTGTGCCG
GCCATGGCCT GGTTGCCGGC GCCCGATACC AGTCCGGTGA CGAAGAGCGG CGTCACCATG
CACGACAATC CGCCGAGCCC GCTGTTCCTG CGCATCTTCA TGGCAGCCTA CTTCTGTGCC
GGTTTCGGCT ATGTGATCAG TGCCACCTTC ATCGTCGCCA TCGTCAATGG CCTGCCTGGT
CTGGCCGGGC AAGGCGGGCT GGCCTTCCTG GCCATCGGTC TGGCCGCTGC GCCCGCCGCC
TTCAACTGGG ATCTGATCGC CCGCTACACG GGGGACATCA ATGCCCTGAT ACTCGCCGCC
GTGCTGCAGA TATTCGGCAT TGTCCTGCCG GTGGCGGTCG GTGGGCTGAT TCCGACGATC
TTTGGTGCGT TGTTGTTCGG TGGAACCTTC ATCGGCATGG TGAGTCTGGT CTTGACCATG
GCCGGGCGCT ACTACCCGAC CAAGCCGGCC AAGATGATGG GCAAGATGAC GCTCTCCTAC
GGCGTGGCGC AGATCATCGG GCCGGCCATC GTTGGCTGGC TGGCCACCCG GCTCGGTAAC
TATTCGATTG GCCTGTATAT CGCGGCCGGC GTGATGGTGA TGGGCGTCGT GCTGTTGGTT
ATACTGAAAC TGGTGGAAAA GCGGGACGCC ACGCTGGCGC TTGAGCCCAG CTTGCAGAAC
TGA
 
Protein sequence
MNSQRERLKV LSAGIFSLLL TFGVARFAYT PLLPIMQQQA GLGLAEAGWL AALNYAGYLS 
GALIASLISN LVLKDKLYRI GLVVAILSTV MMGLTTDPLL WMASRFIAGL SSAAGMLLGT
GLILNWLIRH NHRPELGIHF AGIGLGISGC AVAVWLMGGW LDWREQWFAF SAIACLLIVP
AMAWLPAPDT SPVTKSGVTM HDNPPSPLFL RIFMAAYFCA GFGYVISATF IVAIVNGLPG
LAGQGGLAFL AIGLAAAPAA FNWDLIARYT GDINALILAA VLQIFGIVLP VAVGGLIPTI
FGALLFGGTF IGMVSLVLTM AGRYYPTKPA KMMGKMTLSY GVAQIIGPAI VGWLATRLGN
YSIGLYIAAG VMVMGVVLLV ILKLVEKRDA TLALEPSLQN