Gene Daro_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3478 
Symbol 
ID3566938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3726071 
End bp3727591 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID637681950 
Producthypothetical protein 
Protein accessionYP_286677 
Protein GI71909090 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.153576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGA CTTCATGCCC CCCAGCCACC AGCACTTTGC TCGCTCCTAA AAAAATGGGC 
ATGAGCTTAC CAACCAGGCC ACCAGGCCAG ATCGGCACGC TGCTGGCCCT GTTCATCGCC
GTCCTTGCCT GGCGGGCGAT TGCCTTTTCG TTATCGAACG CGACCTTGTA TGTGGACGAA
GCGCAATACT GGTTCTGGTC GCAACATCTG GCGTGGGGCT ATTTTTCCAA GCCACCGGGC
ATCGCCGCCC TGATTCATCT GTCGACGGCA CTGTTTGGCG ATGGCCCGCT CGGCGTCAAG
GCGCTGACCA TGCTGTGCTA TCCGTTGAGT GCGCTGATCT GTTGGCTGAT CGCCCAGCAC
CTCTACGACG CCTCGACTGC TTTCTGGGCT GCCATCGCCG CACTCACGCT GCCCATCTAT
TCCTGGCTTG GCCTGTTCGC TTCGACTGAT GCGCCATTGA CGTTACTCTG GCTGCTCGGC
CTGTGGTTTT ATCTGCGCGC CATTGAACAT GGACGCTGGA TGGATTGGCT GATGCTTGGC
GCGGCCTGCG GATTGGGATT GCTGTCGAAA TACACGATGG CAGTATTCAT CGCTGCGCTC
TTCCTGCACC TGCTCTGCTT TCACCGCACT TTCCTGACCA GCGCCAAGCC CTGGGCGGCT
GCCGGACTGA GCCTGGCATT GCTTGCGCCC AACCTGCTCT GGAACCTTGC CAACGATTTC
CCAACACTTC GCCATACCGC CGATATCACG CTGAATCGCC ACAACGGCGG TGGCTTGAAA
TCGCTGGCGG AGTTCTGGGC GGCCCAATGG ATCAGCTTCG GACCGCTACT CGGCAGTGTC
GTCGCACTGA TCCTGTTCCG TTTTCGCGAG ACCTGGCGCG ACACACCAGC CCGCCTGCTG
CTCTGGTTCT CGCTGCCGCT GTGGGCCGTC GTCTCGGTGC AGGCGCTTCA AGGCAGCGCC
AACGCCAACT GGGCGGCACC CGCATTCGGG CCGATGGCCA TCCTGCTGGT CGCCTGGTTA
CGCCAGCGCG ACCAGCACAA ATGGCTATTG ACCGGGGTCG CCACCAACTT CGCCCTCATC
GGCGTGATCT ACCACGCCCC CGGCCTGCTG GCAGCCGCCA ATGTAAGCAG CCAGGCGAAA
CTGAACCCGT TTATTCGTGC AACTGGCTGG GATGAGCTCG GACAGCAGCT TCGCCCCCTC
GTACAGACCC ACCCCAATGC CGTGCTGATC GCGAACAACC GCACGCTGCT CGCGCACATG
GCTTACGAAC TGCATGGCCA GCAGCCGCGC ATTGCCAGCT GGAACCCGGA AGGCGTGGCC
AGCGACCACT TCAAATTGAC GATGAAGCTC GACGCTCACC GTGGCGGCGA TGCGCTGTTG
CTGACCGAGG CTGCACCAGA CCAGGAATTC ACCGAAAGGT TCACGCACGT CGAAAAGCTG
GCCTCGCTGG CAGCGCCACT CGACACAATC AATTCACGCC ATATCGAGGT TTATTTACTC
CATGAATTCC AGGGATATTG A
 
Protein sequence
MFETSCPPAT STLLAPKKMG MSLPTRPPGQ IGTLLALFIA VLAWRAIAFS LSNATLYVDE 
AQYWFWSQHL AWGYFSKPPG IAALIHLSTA LFGDGPLGVK ALTMLCYPLS ALICWLIAQH
LYDASTAFWA AIAALTLPIY SWLGLFASTD APLTLLWLLG LWFYLRAIEH GRWMDWLMLG
AACGLGLLSK YTMAVFIAAL FLHLLCFHRT FLTSAKPWAA AGLSLALLAP NLLWNLANDF
PTLRHTADIT LNRHNGGGLK SLAEFWAAQW ISFGPLLGSV VALILFRFRE TWRDTPARLL
LWFSLPLWAV VSVQALQGSA NANWAAPAFG PMAILLVAWL RQRDQHKWLL TGVATNFALI
GVIYHAPGLL AAANVSSQAK LNPFIRATGW DELGQQLRPL VQTHPNAVLI ANNRTLLAHM
AYELHGQQPR IASWNPEGVA SDHFKLTMKL DAHRGGDALL LTEAAPDQEF TERFTHVEKL
ASLAAPLDTI NSRHIEVYLL HEFQGY