Gene Sare_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1086 
Symbol 
ID5704077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1223122 
End bp1224822 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content68% 
IMG OID641270601 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001535985 
Protein GI159036732 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[Z] Cytoskeleton 
COG ID[COG5184] Alpha-tubulin suppressor and related RCC1 domain-containing proteins 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.628609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.014892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGAT TCTGTCAACG GCTGGGCTGG GACGTCGGAC GGCATCAGGC GGGACGCACC 
GCCGCCGGAT GGTTCTCGCT CGCGCTGGCC GTGACGGTCA CGCAGGCAAC CGTCGTCTCG
CCCGCGTTCG CGCAGCACAA GCCCCCGACG ACCGCCGCCG TGACGGACAC CGCACTCGCC
TGGGGCGAGA ACGACGAGGG GCAGTTGGGC AACGGAGGCA CCACCAACAC CAATGAACCG
ACTGCGGTCA GCCTGCCCTC GGGCACCCTG GTCACCGCCA TCGCCGGTGG CGACGGCCAC
AGCCTGGCGT TGACCTCCAC CGGCAGTGTG CTCGCCTGGG GCGACAACTC CGATGGTCAA
CTCGGCGACG GAACCACCAC CGACACCACC ACACCTGTCG CGGTAGACCT GCCCACGGGA
ACCGAGGTAA CCGCCATCGC CGCCGGCAAC GACCACAGCC TGGCGTTGAC CTCCACCGGC
AGTGTGCTCG CCTGGGGCGA CAACTCCGAT GGTCAACTCG GCGACGGCAC CACCACCGAC
ACCAGCACAC CCGTCACCGT GGACCTGCCC ACGACCACCA CGGCCACCGC CATCTCTGCC
GGTGCCGACT ACAGCCTGTC ACTGTCCTCC ACCGGCGGCG CCTTCGCCTG GGGGAACAAC
GACCAAGGCC AACTGGGCGA CGAGACCACC CTCAGCACCA GCACACCCGT CAACGTCGCC
CTGCAACCGG GCACCACGCT CCTCGCCGTC GCCGGAGGTT CCGGCCACAG CCTGGCGATA
ACCTCAGACA ACGCCGCCAT CGCCTGGGGG GACAACTCCC AGGGTCAACT CGGCGACGGC
ACCACCACCG ACGCCCTCGC GCCCGTCAAC GTCGCCCTGG CACCGGGAAC CGAGATCACC
GCCGTCGCCG CCGGGCGCCT CCACAGTGTG GCGTTGACCT CCGCCGGCAC TGCCTTCACC
TGGGGCAACA ACGCCTCGGG CCAGCTGGGC AACGGGACCA ACACCACCAG CAGCACTCCG
GTCGCGGTCA GCCTGCCCAC CGGCACCACG CTCACCGCCA TCGCCGCCCA CAACAGCAAC
CACACCGTGG CGATCACCAA CACCGAAACC GCCCTCGCCT GGGGCGACAA CTCCTTCGGT
CAACTCGGCG CCGAGATCAC CATCACCAGT AGCAGCAACA CACCCATCCC GGTCAACCTG
GCCGCCGGCA CCACGGTCAC GACCACGGCC GTCGGCAACA ACCACAGCCT GGCCCTGCCC
ACACTGCAAC CAAGCTCCAC CACGAACCTG AACGTCTCAC CCCCGGACCC GACAGCAGAT
CAGGACGTCA CCCTCACCGC CACCGTCACC TGCAACATCG ACACCCCCAC CGGAACCATC
ACCTTCCGCA ACAACAACAC CGACCTCGCC ACCGTGCCCC TGGACAGCAA CAACACCGCC
ACCCACACCA CCCGACTCCC ACCCGGCACC CACACCCTCA CCGCCCACTA CACCAGCACC
AACACCTGCC CCAGCGGCCA ATCCGAATCC ACCACCATCA CCATCACCGC ACCCGACAAC
CCCAACACGC CCGACGACCC CGACCTACCC ATCACCGGAC CCAACCTGCC CACCATCCTC
GGCACCGCCA CCCTGCTCAT CCTCGCCGGC GCCGCATTCC TCTTCCTTAC CCGCCGCAAC
CGAACAACAC ACCAGAAATA G
 
Protein sequence
MQGFCQRLGW DVGRHQAGRT AAGWFSLALA VTVTQATVVS PAFAQHKPPT TAAVTDTALA 
WGENDEGQLG NGGTTNTNEP TAVSLPSGTL VTAIAGGDGH SLALTSTGSV LAWGDNSDGQ
LGDGTTTDTT TPVAVDLPTG TEVTAIAAGN DHSLALTSTG SVLAWGDNSD GQLGDGTTTD
TSTPVTVDLP TTTTATAISA GADYSLSLSS TGGAFAWGNN DQGQLGDETT LSTSTPVNVA
LQPGTTLLAV AGGSGHSLAI TSDNAAIAWG DNSQGQLGDG TTTDALAPVN VALAPGTEIT
AVAAGRLHSV ALTSAGTAFT WGNNASGQLG NGTNTTSSTP VAVSLPTGTT LTAIAAHNSN
HTVAITNTET ALAWGDNSFG QLGAEITITS SSNTPIPVNL AAGTTVTTTA VGNNHSLALP
TLQPSSTTNL NVSPPDPTAD QDVTLTATVT CNIDTPTGTI TFRNNNTDLA TVPLDSNNTA
THTTRLPPGT HTLTAHYTST NTCPSGQSES TTITITAPDN PNTPDDPDLP ITGPNLPTIL
GTATLLILAG AAFLFLTRRN RTTHQK