Gene Amir_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1954 
Symbol 
ID8326139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2162777 
End bp2164489 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content73% 
IMG OID644942503 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003099748 
Protein GI256376088 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.614483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCACT ATCGGTTCAT GCGTTTGAGT CGACTCTCGG TAGTTGCGGG TATTGCCACC 
GGTCTGGTTG TCACGACGGC CACCGTCGTC CTGGCCAGCG CCCAGGCCAG CCCCTCCCCC
GCCACGGGCC CCGGCCCGTG CGTCGGCGCC GAGTGCCCGG CCGAGTACGC GCCGGTCCGC
GGCAACAGCG GGCGGTTCGA CGGGCGCGAC GAGGCGGTCA ACGTGTTCGT CGGCAAGTCG
TTCACCGTGT CCGGCAACGC GGCCGAGGCC GAGGGCAGGC TCGTCGTCGG GGGCTCGTTC
ACCCTGGCGA AGGACGGCAG CGGCGTCGGC TACAACGTCG GGACCGTGGG CGCGGGCTCG
CAGGTGCCGC CGCCCGCGGG CTCGGACTTC CTGGTCACCG GCGGGGACCT GACCATCGCG
CCGGGCCAGG AGCTGCGCGC CGACGGCGGT GTCGTGCGGT ACGCGGGCAC CAAGACCGGC
GTGGTCAGCA GCACCGGCGC CGCCGTGCAG GACGACAACG CCTTCGCGCC CTACGCGGGC
ATCGGCGAGG CGCTCCGCGA GGACAGCGCC TGCTACGCCG CGCTCCCCGC CACCGGGACG
GTCACCCGCG ACGACCTCGC CACCACGTTC ACCGGCGACG GCGTGTCCGC GCTCCAGGTC
TTCACCCTGG CGGGCGACAT CACCGGGGCG AACGGGTCGA TGCAGGGCAT CGAGTTCGTG
GGCGTCCCGG ACGGCGCGAC CGTGCTGGTC AACGCCACCG GTTCCGCGCC CCGCATCACC
AGCTGGTCCG GCACGCACAA CAACCGCGAC GGCATCGACC GCCTCGGGCA GCGGCTGCTG
TGGAACTTCC CGAACGCCAC GACCGTGACC CTGAACGGCC AGTCCGAGTT CCAGGGCAGC
GTGGTCATCC CCCGCCAGGA CAGCACCGCG AAGGTCAGCA CCCCCGGCTT CAGCGGGCGG
TTCTTCACGG CGGGCTCGCT GGAGCACGGC GGCAACGGCA GCGGCGACGG CAACGAGTTC
CACGCGTACC CGTTCACCGG CGTGATCCCG ACCTGCGGCA CCGGCACCAC GACGGTCCCG
AGCAGCAGCA CGACCAGCAG CACCAGCACG ACGACGGTCG AGACCACGAC GTCGAGCAGC
ACCACGACCA CCGCGCCCAG CACCACGACG ACCGCGCCGA CGACCACCAC CGCGCCGAGC
ACCAGCACCA CCACGAGCAC GACCGTCGAG ACCACGACCA CGACGCCGTG CGAGGAGACC
ACCCCGGAGA CCAGCACCAC CACCACGGTC CCGACCACGA CGACCGCGCC GAGCACCAGC
ACGACGACCA GCACCACGAC GGTCGAGACC ACCCCGGAGT CGAGCACCCC GGAGACCAGC
ACCAGCAGCA GCTCCACCAC GACCACCGCG CCGTCGACCA CGTCCTCGTC CTCCTCGGCG
ACGACCTCGT CGACCACCTC CGCGACGACC TCGTCGTCGT CGACGTCCGA GACCACCTCG
GCGACGACGT CGACCAGCAC TGAGGCGTCG AGCACCACCG CCACGAGCCC GAGCACCTCG
GAGAACCCGG TCGTCCCGGC CGTCGCGAAG ACCTCGGGCG GCGCGGGCCT GGCCCACACC
GGCTCCCCCG CCGGGATGGC GCTGGCCATC GGCGCGCTCC TGCTGATCGG CGGCGCGGCG
CTGTTCGCGG TGACCCGCCG CCGCAAGGTC TGA
 
Protein sequence
MPHYRFMRLS RLSVVAGIAT GLVVTTATVV LASAQASPSP ATGPGPCVGA ECPAEYAPVR 
GNSGRFDGRD EAVNVFVGKS FTVSGNAAEA EGRLVVGGSF TLAKDGSGVG YNVGTVGAGS
QVPPPAGSDF LVTGGDLTIA PGQELRADGG VVRYAGTKTG VVSSTGAAVQ DDNAFAPYAG
IGEALREDSA CYAALPATGT VTRDDLATTF TGDGVSALQV FTLAGDITGA NGSMQGIEFV
GVPDGATVLV NATGSAPRIT SWSGTHNNRD GIDRLGQRLL WNFPNATTVT LNGQSEFQGS
VVIPRQDSTA KVSTPGFSGR FFTAGSLEHG GNGSGDGNEF HAYPFTGVIP TCGTGTTTVP
SSSTTSSTST TTVETTTSSS TTTTAPSTTT TAPTTTTAPS TSTTTSTTVE TTTTTPCEET
TPETSTTTTV PTTTTAPSTS TTTSTTTVET TPESSTPETS TSSSSTTTTA PSTTSSSSSA
TTSSTTSATT SSSSTSETTS ATTSTSTEAS STTATSPSTS ENPVVPAVAK TSGGAGLAHT
GSPAGMALAI GALLLIGGAA LFAVTRRRKV