Gene Sare_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2621 
Symbol 
ID5703877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2985991 
End bp2987604 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID641272082 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001537452 
Protein GI159038199 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCG GCGCACCCGG CCTCGCCCTG GCGATCCTGG TGCTGGCCGC AGGGGCCGGG 
CCGGCCATCG CCGAGCCCGC CGCGATCACC GCGACCGGCG CGGACCTGGT CTCGGTGGCT
CTTGATGGTG ACGCGATGTC GCTGCGGATC CGAGACGCCG CCCAAGTGGC CCGCGAGGCA
CCAGGGCACG ATCCGGCCGA GTTCGTGCTC GGCCCCGACG GCGGCCTGGC CGGCCGGGTG
CCGGCCGGAG AAGCCTTTGC CTTCCTCGGC CCGCCGGGCC AGCCGGTGTG GTCGCTGTCC
GCCGGCGACA CCGGGTTCCC GGCCCTGGAC ACCACTGGAG TACGCCCCGG CGTCCTCGAC
GACGGCATGG TGACCCTCAG CCTGCGTTCG ATCGACGGGC CGGGAACGTT CACCGCCTAC
AGGCTGTCGA GCATCGGGCG GGTGACTGCA TTGTTCGGCA GCGGTACGAA TGTGACGCGG
TCTGTTCAAC TAGCGGCGGC AACGCGCACC GGTGGTGTGG TGTGGGCTTT CGACGCCGCT
GGCGACTATC GGCTCACCCT GGCCGCATCG GCGGGCCTGC ACTCCGGTAA GACGGTTAGC
GCTGAGGCGA CATATCGGAT CCGCGTACCC GCGATCATGC CGCCCGGACA GATGCTGCCA
TCGGCGGCAC CACAGCAGAC AACGCGGCCC GACGGCCACC CGACGGTACA GACGTTCGCT
GCTCCCGCTG CCGAACCGAA GCTCGCTGCC GAACCGAAGC CCGCCGCGGA ACCGAAGCCC
GCCGCCGCGC CGGCGGCACC GGCCGCGAGG GTGGCGGCCG CCACCAGCAA GGGCGTGCGG
CACGTGATCG CCGATGGGCA CGTCGACATG GGCCCGCAGC TGTCCGGAGA CACCTGGACG
ATCCGGATCA AGGACGACCG AAGCAGCCCC GCGGTGTGGC GGGAAACCGC TGACGTGGTC
TTGCACATCA AGGACAACGC GAAGATCACC GTGCCTGCCG GCGCGGACTT CCTCGGTAGA
CAGGGCGACA CGGTGTGGCT GCTCCCGCAG TCCCAGCAGG CCGGCATCGT CTGGCCAGGC
TGGAACACCC AGCACCAGTC CGTCGTGTCC GGCGTCAAGG GCAACGTCAC CTGGACGCTC
CGGGGCGTCA ACGGGCCGGG CCGGTTCGCT CTGTTCCTGA CCGGCTCGTT CGGCAAGGCC
GACGTGCTGT TCGACTCCGC CAAGTCGTTC CCGCAACAAC TGGCTGTCCC GCTGAACACT
CACGCGCACG GGAACTGGGC GTTCACCAAA CCCGGCCTGT ACCGCCTCGC GGTGCAGATG
AGCGGCACCA CCACCGCCGG CAAGGCGGTC ACCGACACGA AGACGCTCAC CATCGCCGTT
GGTGACAGCA CCGACCCGAC GGTCGGCTTC GGACCGGGCA GTGCTTCCGA AGGCGGCGGG
GAGAACAACG GGAAGGACCA GGGTGGTACA GGCCCGCTGC CGCGTACCGG TGTTGGCTGG
GTGCTGTCGG CCGGCGCGGC CGGCATGGGC CTCGTCGCCG CCGGGGTCTT GCTGGTGCTG
CTCGCCCGCC GCCGCTGTAC CGGCCCCGCT GACCGCGCAG TGGGGAACCA GTGA
 
Protein sequence
MRVGAPGLAL AILVLAAGAG PAIAEPAAIT ATGADLVSVA LDGDAMSLRI RDAAQVAREA 
PGHDPAEFVL GPDGGLAGRV PAGEAFAFLG PPGQPVWSLS AGDTGFPALD TTGVRPGVLD
DGMVTLSLRS IDGPGTFTAY RLSSIGRVTA LFGSGTNVTR SVQLAAATRT GGVVWAFDAA
GDYRLTLAAS AGLHSGKTVS AEATYRIRVP AIMPPGQMLP SAAPQQTTRP DGHPTVQTFA
APAAEPKLAA EPKPAAEPKP AAAPAAPAAR VAAATSKGVR HVIADGHVDM GPQLSGDTWT
IRIKDDRSSP AVWRETADVV LHIKDNAKIT VPAGADFLGR QGDTVWLLPQ SQQAGIVWPG
WNTQHQSVVS GVKGNVTWTL RGVNGPGRFA LFLTGSFGKA DVLFDSAKSF PQQLAVPLNT
HAHGNWAFTK PGLYRLAVQM SGTTTAGKAV TDTKTLTIAV GDSTDPTVGF GPGSASEGGG
ENNGKDQGGT GPLPRTGVGW VLSAGAAGMG LVAAGVLLVL LARRRCTGPA DRAVGNQ