Gene Amir_5347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5347 
Symbol 
ID8329549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6361708 
End bp6363498 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content76% 
IMG OID644945785 
Productvon Willebrand factor type A 
Protein accessionYP_003103013 
Protein GI256379353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.23692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGAC ACCGCACGCT GCGGACCAAG GTGCGGCGCG GCATCGCCGG GTGGCCCATC 
ACCATCATCG GCGTCGTGGC GCTGCTGGTG CTCGGCTGGT TCGGCTGGCG GTGGATCGGC
GACGTGGTCG ACCAGCGCGC GGCCGTGCAG GCCGGGGACT GCAACGAGGG CCCGGCGACG
TTGAAGGTCG CCGCGACCCC CAGCGTGGCG GACGCGGTGC GGCAGGTCGC GCAGGCGTGG
AGCGCGCAGC GGCCCGTGGT GTACGACCAC TGCATCGGCG TCGAGGTCCT CGCCAGCGAC
TCCGAGGTGG TCCTGGAGGG CCTGACGAAC ACCTGGGACG AGGAGAAGCT CGGTTCCCGG
CCGCACGCGT GGGTCACCGA CTCGGCGGTG TGGGCGAACC GGCTGGCCGC GCAGCGCCAG
TCCATGATCG GGTCCCCGCC GGAGTCGATC GCGACCAGCC CGGTGGTGCT GGCCATGCCG
CAGGAGGCGG CGGACGCGGT GCAGGCCGGG CCGGGGTTCC GGTGGACGGA CCTGACCGCG
ATGACCTCGT CGGCGACCGG CTGGGACCGG TTCGGCAAGG CCGGGTGGGG GGCGTTCAAG
GTCGCCATGC CCGACCCGGC GGTCAACCCC GGCACGGCCA TGGCGCTGGA GGCGGCGCTC
GCGGGCGCGG GCGCCGACCC GACGGGGCCG GTGACGGCGG ACCTGCTGGC GCAGGAGCCG
GTGAAGCAGG CGATGGCGAA GCTGGTCGCG GCGCGCCCGG AGCAGACGAC GACCAGCACG
TGGCAGGCCA TGGCGGTGCT CGCGGCGAAC CCGGCGGTCG GCTCGGTCGG GTTCAGCGCG
GTGCCCGCGC TGGAGGTCGA CCTGTACCGG CACAACACCG GCGCGGAGGA CAACCGCCCG
GCCCCGGCGA CGCCGCTGGC GGGGGTGGCC GCGCAGGGCG TGACGCCGGT GGCGGACTTC
CCGTTCACCG CGCTGTCGGG TGAGTGGGTG AACGAGGCGC AGGCGCGGGC CGCGCAGGCG
TTCCGGACCT TCCTGAAGGC CCCCGAGCAG CGGGCGACGC TGGCGGCGGC GGGACTGCGG
GTGGAGGGCG TGACCGAGCG GCCGAGCCCG GCGCCCGGCA TCGCGTGGGC CGAGGTGACC
GAGCAGCTCA AGCCCGCCGA CGCGGCGGCG ACGCAGCAGG TGGCGGGCGC GTGGGCGACC
GCCGACAACG GGCAGGTCGT GACCGTGCTG GTGGACACCT CGAAGACGAT GGGCGAGGAC
GGCGGCGACG GGCGCACCCG GCTGGAGTGG GTGCGGGAGG CGCTGACCGG GCAGGCGAAC
CGGGCGGTGT CCGGGTCGCT CGGGCTGTGG GAGTTCGCGA CCGGGGCCGA CGGGGACAAG
GCGTACCGGG AGCTGGTGCC GACCGGGTCG GTGGGGGCTC AGCGGCAGTC GCTGCTGGAC
GCGGTGGGAC GGCTCAAGCC GCGCGGCGAC GACCGGCCGT TCACGGCGCT GATCGCGGCC
TACGAGGACG TGCTGGCGGA CCACCGGGAC GGGAAGCGCA ACCGGATCGT GGTGATCACG
GACGGCGGGG CCGACGGGGA CCTGTCGCCC GCCGACGCGA AGGCGCACCT GGAGGGGCTG
AAGGTCGCGG GCAAGGACGT CGGGATCAGC GTGGTCGCGC TGGGGGGCGG CGCGGACGGG
CCGGGGCTGT TCCAGGACAT CACGAAGGCG TTCGGCGGCG GGACGGTGTC GGTGGTGGAG
GACGGGAGCG GCGTGGACGC GGCGCTCGGT CAGGTGCTGG CCGGGCGGTG A
 
Protein sequence
MSRHRTLRTK VRRGIAGWPI TIIGVVALLV LGWFGWRWIG DVVDQRAAVQ AGDCNEGPAT 
LKVAATPSVA DAVRQVAQAW SAQRPVVYDH CIGVEVLASD SEVVLEGLTN TWDEEKLGSR
PHAWVTDSAV WANRLAAQRQ SMIGSPPESI ATSPVVLAMP QEAADAVQAG PGFRWTDLTA
MTSSATGWDR FGKAGWGAFK VAMPDPAVNP GTAMALEAAL AGAGADPTGP VTADLLAQEP
VKQAMAKLVA ARPEQTTTST WQAMAVLAAN PAVGSVGFSA VPALEVDLYR HNTGAEDNRP
APATPLAGVA AQGVTPVADF PFTALSGEWV NEAQARAAQA FRTFLKAPEQ RATLAAAGLR
VEGVTERPSP APGIAWAEVT EQLKPADAAA TQQVAGAWAT ADNGQVVTVL VDTSKTMGED
GGDGRTRLEW VREALTGQAN RAVSGSLGLW EFATGADGDK AYRELVPTGS VGAQRQSLLD
AVGRLKPRGD DRPFTALIAA YEDVLADHRD GKRNRIVVIT DGGADGDLSP ADAKAHLEGL
KVAGKDVGIS VVALGGGADG PGLFQDITKA FGGGTVSVVE DGSGVDAALG QVLAGR