Gene Daro_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3551 
Symbol 
ID3567629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3803227 
End bp3804870 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content62% 
IMG OID637682024 
Productglycosyl transferase family protein 
Protein accessionYP_286750 
Protein GI71909163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.141392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.701436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCA GACCAATCCT TCGTTCCATC GAGCGCTTCC TGCTGTCGCC GTTCAGCCTG 
GTGCTGGCGA TCGTCATCGC TTTCGTGCTC AATGCCTACA GCCTGCCGCT GACCGATGTC
GACGAAGGGG CGTTTTCCGA AGCGACGCGC GAAATGATGG CGCGTGGCAA CCTGATCTCG
CCGACCCTGA ACGATGCGCC GCGCCACGAC AAGCCGATCC TGATCTACTG GGCGCAGGCT
GTTTCAGTCA GTGTGCTCGG GGTCAGTGAA ATCGGCTTCA GGTTGCCGTC GATCATCGCC
AGCATCCTTT GGTTGCTCTT CCTCTACCGC TTCTGTCTGC ACCATGCCGA CCGGCGGACG
GCGCAGGTCG CTGCGCTGGT CATGGCCTTG TCGCTGGCCG TTGGCTTCAT CGCCAAGGCC
GCGATTTCGG ATGCGCTGCT CAACCTGTTC ATCGCCCTGT CGATGTTCGG CATCTACGAC
TTCTTTTGCT CCAGCCGGGA TGGCAAGAGC CCGCAACAAA TCCGCCGGCT CCTGTTCTTC
GTCTATGCCA TGCTCGGCCT CGGCTTCCTG ACCAAGGGGC CGGTCGCGGT GATGTTCCCG
CTGTTGATCA GCGGTATTTT CTTCGTCTCG GCCGGCGCCT GGCGTGACTG GCTGAAAGCG
GCCTTTTTCT GGCCGGGCTG GCTGCTTTTC CTGGCCATCG TCGTGCCCTG GCATGTGCTG
GTTTATCTGG ATCAGGGCGA CGCCTTCTTC CGCGGTTTCT ACCTGAAGCA CAACCTGAAT
CGCTACTCCA ATACCTTCGA GGGGCATGGC GGCAAGTGGT ACTACTACCT CGTCGTCCTG
CCCTTCGTGC TGATGCCGTT CACCGGCTGG CTGCTCGCCA TCGTCGGCCA GCTGCTCAGG
AAGCTGTGCT CGGTGCCGCT CGGCCTGGCT TACGGCGAGG GACTGTTCGA ACGCTACCTG
ATCCTGTGGT TCGTCGTCGT CTTCGGCTTC TTCTCCTTCT CCGGGACGCA GTTGCCGCAC
TACCTGCTCT ACGGCTGCAC GCCGCTCTTC ATCCTGCTCG CCCGCTATCG TCCGGAGTTC
GAGCGGCGCT GGCTGGCCTA TCTGCCGCTG ATCGCCTTTA CCCTGTTGCT GGCTGCCTTG
CCGGAAATTC TGGTCTTTGC TGCCGCCAAG GCGGGCAAAC CTTTTGAGAA GACGCTGCTC
GACGGCCTGG TTGCGGCCTT CTCCGACGAA GCGCGCTGGT TGCTCCCCTT GCTCGCCGTT
GCCGTCATTG CATTGACCTT GTGGCGCAAC TTGCCGGTCT GGCAAGGGCT GATCATTGCC
GGGCTACTGC AGGCACTGGT CGTTGGCGCG GTCATCGCGC CGCGAGTGAT CGGTGTCACC
CAGGGGCCGA TCCGCGAGGC GGCCATGGTC GCCAAACAAA GTGGCGGCAC GGTGGTGGCC
TGGCGCATCA TCATGCCCAG CTTCAGCGTC TATCGGCAGT CGGCGACGCC AACCAGGGTG
CCCGTTGAGG GAGAGCTGGT GATCACCCGT ACCGACCGCC GGCATGAGGT GCAGGCGCTG
CTGGCTTCCG GTCTGACGAT GCGCGATATC TACGTGAAGA GTTTCGTGAC CCTGGCTCGC
GTCGAACGCG AGGCGGCGCG GTGA
 
Protein sequence
MQARPILRSI ERFLLSPFSL VLAIVIAFVL NAYSLPLTDV DEGAFSEATR EMMARGNLIS 
PTLNDAPRHD KPILIYWAQA VSVSVLGVSE IGFRLPSIIA SILWLLFLYR FCLHHADRRT
AQVAALVMAL SLAVGFIAKA AISDALLNLF IALSMFGIYD FFCSSRDGKS PQQIRRLLFF
VYAMLGLGFL TKGPVAVMFP LLISGIFFVS AGAWRDWLKA AFFWPGWLLF LAIVVPWHVL
VYLDQGDAFF RGFYLKHNLN RYSNTFEGHG GKWYYYLVVL PFVLMPFTGW LLAIVGQLLR
KLCSVPLGLA YGEGLFERYL ILWFVVVFGF FSFSGTQLPH YLLYGCTPLF ILLARYRPEF
ERRWLAYLPL IAFTLLLAAL PEILVFAAAK AGKPFEKTLL DGLVAAFSDE ARWLLPLLAV
AVIALTLWRN LPVWQGLIIA GLLQALVVGA VIAPRVIGVT QGPIREAAMV AKQSGGTVVA
WRIIMPSFSV YRQSATPTRV PVEGELVITR TDRRHEVQAL LASGLTMRDI YVKSFVTLAR
VEREAAR