Gene Sare_4466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4466 
Symbol 
ID5708341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5046885 
End bp5048105 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID641273882 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001539231 
Protein GI159039978 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.137479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCGT TCCACCGTCC GTCGCTGGCC CGTGCCGGGG CGCTCGCCCT GCTCGTAGCG 
GCCGGCAGCA CCGGCGTGTC CGCTCCGGCG CACGCTGCCG GCGAAGCCGA CCTGGCGTTG
ATCCCGCTCA GTTCCGAGCT GGCCAAGGGT GTCGAGGAGG CCAAGGCCAA GCCGTTCAAA
TTTCAGGTCA ACAACACCCG CAGCACGGTC GACGCGAAGG CCGTTCAGGT GACCGTCGAA
ACGGCGCACC TCAACGACCG CAAGGTCGGC GTGGTGGTGC CCGACGGCTG TGAGGCCACC
GGGACCACGT TCAGCTGCCT CCTCGGCGAC CTACCCGCGG GCACCACCGA GGACTTCGGC
ATCCCACTGT TCTCCCTGGG CAAGCGCGGG GACGCCGGGC ACCTGGTCGT CACCGTGACC
TCGGCCACCA CCGACCCGTT CATGGAGGAC AACACCGTCG AGCACGACAT CACCGTCGCC
AAACCCGGCC ACGACCTCAC CACCTGGGTG CAGGACGTGT ACGCCGACGT GGAGGTCGAC
GGCGACGACC GTGGTGAGCA GGCACTGTTG CCGGTGCGGC CGGGCGAGAC CGCGCCGCTG
GACTGGGCCG TGTACAACCA CGGCAGCCGT GCGGCCACCG GCATCGCGTA CGGGATCGCC
CTGCCGGCCG GTGTGACCTT CGCCGAGCTG CCGGAGGGCT GCGACGAGTC GGATGGCGAG
GGCCCGGCGC TGGCGCACTG CGCGGACTCC GGCGCGGTGC TGCGCCCCGG CGAGTTCTAC
ACCGCCGACG TGCGGGTGCG GGTGGACGCC GACGTGACCG AGCCGGTGCT CCGCCCGGGC
TTCCTCTACG GCTTCGGCCT TGACGTCGTG GCCGGCGAGC CGGAGGCGAC GCCGCGAATC
GCCTCCGACA CCCAGCGCCG GACCTTCGCC GATGTCGATC CCGGCGACGA TTGGGCCCAG
TTCGACGTGT TCGTCGACCT CTCCCCGGTC AGCACCCCGA CCCCCACGCC CACCGGGGAA
CCGACCGGTT CGCCGAGCCC GACCGCCACC GCGACCCCGG GCGGGTCCGG TGGTGGCGGC
CTGCCGGTCA CCGGTGTGCA GGCCGGCCTG ATCGGCGGCA TCGGCGCGGC CGTGCTGCTG
GCCGGCGGTG TCCTGCTGCT GCTCTCGCGG CGGCGGAAGG TCGTCCTGGT GAACCCGGCC
GACGAGCGGA CCATCGACTG A
 
Protein sequence
MIPFHRPSLA RAGALALLVA AGSTGVSAPA HAAGEADLAL IPLSSELAKG VEEAKAKPFK 
FQVNNTRSTV DAKAVQVTVE TAHLNDRKVG VVVPDGCEAT GTTFSCLLGD LPAGTTEDFG
IPLFSLGKRG DAGHLVVTVT SATTDPFMED NTVEHDITVA KPGHDLTTWV QDVYADVEVD
GDDRGEQALL PVRPGETAPL DWAVYNHGSR AATGIAYGIA LPAGVTFAEL PEGCDESDGE
GPALAHCADS GAVLRPGEFY TADVRVRVDA DVTEPVLRPG FLYGFGLDVV AGEPEATPRI
ASDTQRRTFA DVDPGDDWAQ FDVFVDLSPV STPTPTPTGE PTGSPSPTAT ATPGGSGGGG
LPVTGVQAGL IGGIGAAVLL AGGVLLLLSR RRKVVLVNPA DERTID