Gene Daro_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1972 
Symbol 
ID3570221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2121101 
End bp2122357 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID637680443 
ProductMgtC/SapB transporter 
Protein accessionYP_285188 
Protein GI71907601 
COG category[S] Function unknown 
COG ID[COG3174] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.315083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCG TCGTTCCCGA ACTGGCTGCT CCCGTCGAGG CTTTTTCAAC GGCGCTCGGT 
ATCGGTCTGC TGATCGGCAT GGAGCGCGAA CGCCGTCCTG AAGCCTCGGC CGGCCTGAGA
ACCTTCGCGC TGGTCGCCAT GCTGGGCTGC CTCTTTGCCC TGCTTGGTGA CAAAACCGGT
GGTCCCTGGC TGCTGGTTGC CGGCCTGCTG GTCATTTCCG GCAGCATGAT CGCCTCGAAT
TTTTCGGCCC AGCAGGAAGA GCAATATCGG GGCTTTACCT CGGAAGCGGC GATCATCGTC
ACCTATGCGC TGGGCGCCGC CGTCTGGTTC GGCTATTCGA CGCTGGCTGT CATGCTCGCC
ATCACGACCA CCGTGCTCCT CTATTTCAAG GCCGAACTCC GGCAATTCAG CGAACGCACG
ACACCGAAGG ATATCAACTC CATCCTGCAG TTTGCCGTGT TGTCGCTGGT CATCCTGCCC
ATCCTGCCGA GCGAGGATTT CGGCCCCTAC AACGCGATCA ATCCGCGCCA GGTCTGGTAC
ATGGTCGTGC TGATCTCCGG GCTGGCACTG GCCGGCTATC TCGCCTTGCG CATCATCGGC
GCCCGCCACG GCGCGGCATT GCTTGGCATT TTCGGCGGCC TGGCGTCGTC GACCGCAACG
ACCATGATGT TCTCGCGTCA CGCCCGCGAC CATGTTCATC TGGTCCATAT GTCCGCTATC
GTGATCCTGA TCGCCAACGT GATGGTCATG ATCCGCCTTT GGCTGGTTGC CGGCGTGGTC
GCCCCCGGCT TGGCCACACC GATCGCCATC GTCTTCGCCT GCGGCATCGT TCCCGGGGTG
GCCATGTCGC TCTACGGCTG GAGGGTTTTA AGCGCGGCCG GCGAATTGCC GATGCCCGAG
GTCAAGAACC CGACAGAGCT GAAGACAGCC CTGTCCTTTG GTCTGCTTTA CGCCGTGGTG
CTGCTCGCCT CCGCCTGGTT GCAGGATATT GCCGGCAGCA GCGGCCTGTA TATCGTCGCC
CTGGTTTCCG GCCTGACCGA TGCGGACGCC AGCGTGCTGT CCACCCTGCG CATGTTCAAT
CTGGAAAAAG TCGCCAGTGG CGATGCCGTG ATCGCCGTCA CGCTGGCGCT GATGGCCAAC
CTGATCTTCA AGATCAGCCT GGTCATCAGC ATCGGCGGTG GCAAGCTGGC CCGCCATGCC
CTTCCCGGCC TGCTCGCCAT CGGTAGCGGC ATGGCCGTCG GTTTGATGCT CGTTTAA
 
Protein sequence
MSFVVPELAA PVEAFSTALG IGLLIGMERE RRPEASAGLR TFALVAMLGC LFALLGDKTG 
GPWLLVAGLL VISGSMIASN FSAQQEEQYR GFTSEAAIIV TYALGAAVWF GYSTLAVMLA
ITTTVLLYFK AELRQFSERT TPKDINSILQ FAVLSLVILP ILPSEDFGPY NAINPRQVWY
MVVLISGLAL AGYLALRIIG ARHGAALLGI FGGLASSTAT TMMFSRHARD HVHLVHMSAI
VILIANVMVM IRLWLVAGVV APGLATPIAI VFACGIVPGV AMSLYGWRVL SAAGELPMPE
VKNPTELKTA LSFGLLYAVV LLASAWLQDI AGSSGLYIVA LVSGLTDADA SVLSTLRMFN
LEKVASGDAV IAVTLALMAN LIFKISLVIS IGGGKLARHA LPGLLAIGSG MAVGLMLV