Gene Sare_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4920 
Symbol 
ID5707408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5585189 
End bp5586595 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content64% 
IMG OID641274314 
Producthypothetical protein 
Protein accessionYP_001539659 
Protein GI159040406 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0390615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0715167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGGA AGGACACCAA CACCGCGATG AACGATCAGC GTTATGACCA CGAACCGGAC 
CGGCCAGGTG CGCGCCGGGC GAGGTCACGG TGGTGGGCGG TTGGACTGGC CGGCATGACC
GGCCTGGCCC TCACCACCAG CCTCGGCATC GGCGGAGCGC CGGCTATCGG CGCCGTCGAC
GGCATTCTCA CTGCCGCCGA TGACCGGCGG GGAAAACCTG ACCGCGCCTC CACCGACAAC
AAGGACCACA AGGGCAAGGG AACAAAGGAC AAACGCAAGG GCACACCGGT CCCGTGTGAC
GCCGATGCGC TGATCGCGGC GATCACCCTC GCCAACGCCC GCGGCGGCGC CGTCCTCGAC
CTCGCCACAG ACTGCACCTA CCTACTCACC GCCACCATCG ACGGTGCCGG CCTGCCCGCC
ATCACCACCC CGATCACCCT CAACGGCGGC AAACACACCA CCATCACCCG CGCCGCCGCC
GCTGAACCGT TCAGAATTCT CACCGTTGAG GCCGGAGGAC ACCTCACTCT CAACCACCTC
ACCATCACCG GGGGCCAGAC CGAAACGTTT GACGACGGTG GGGGGATCCT TGCCAACAGC
GGAAGCACCC TCGCCATCAA CCACAGCGTG ATCCGAAACA ATATCGGCAA CAACGGCGGC
GGAGTGGCCA ACTTCGGCAC GACCACCGTC AAGCACTCCA CGGTTAGCGA GAATACTGCA
CGGGCCAACG CTGGCGGCCT CCAGAATATG GCCGGACTGC TCACCATCGA ACGATCCAAA
ATCACCGACA ACACCGCCCC CGGATTGGCG ATCGGCGGGG GGCTCGGCAG CATCAACGGC
GCGACCACGC GCATAAACCG GAGCAGCATC ACCCACAACC ATTCAGGACT ATCCGGAGGA
GGAATCGGCG ATTTCGACGC CACCACCGTC GTTACCGACT CCACCATCAG CCAGAACACC
GCTGACGTTT CGGGAGGCGG AATCTTCGAG GAGGGGCAAC TCACCCTGCG ACACGTTACG
ATCACTGACA ACAACGCCCT TGATGGTGGC GGTGGGGTCG AAATTCAAAA CGTTCTCGGC
GGGAGCGCCG CGACCATCGA GGACAGCGAA ATCACCAACA ACACGACGGG ACGGGGCGGA
GGGATTCGCA ACCTCGCCGC CACGATCGTG CTCCGAAACA CCCGGATCGC CGGGAACCAG
GCCGACACCG GCGCCGGCGT CTTCAACAAC ATCGGCTCAA CGCTCACCCT TTTCTCCACC
AAGGTCGTCA AGAACACCGC TGTTACCGAC GGTGGGGGCA TCTTCAACGA GGTGGGCGGC
ACGGTGGAGT TGAATACCGC CACCGGCACT GTTGTGGTCA AGAACCGGCC GAACAACTGC
GTCAACGTCA CCGGCTGCCC GGGCTGA
 
Protein sequence
MTGKDTNTAM NDQRYDHEPD RPGARRARSR WWAVGLAGMT GLALTTSLGI GGAPAIGAVD 
GILTAADDRR GKPDRASTDN KDHKGKGTKD KRKGTPVPCD ADALIAAITL ANARGGAVLD
LATDCTYLLT ATIDGAGLPA ITTPITLNGG KHTTITRAAA AEPFRILTVE AGGHLTLNHL
TITGGQTETF DDGGGILANS GSTLAINHSV IRNNIGNNGG GVANFGTTTV KHSTVSENTA
RANAGGLQNM AGLLTIERSK ITDNTAPGLA IGGGLGSING ATTRINRSSI THNHSGLSGG
GIGDFDATTV VTDSTISQNT ADVSGGGIFE EGQLTLRHVT ITDNNALDGG GGVEIQNVLG
GSAATIEDSE ITNNTTGRGG GIRNLAATIV LRNTRIAGNQ ADTGAGVFNN IGSTLTLFST
KVVKNTAVTD GGGIFNEVGG TVELNTATGT VVVKNRPNNC VNVTGCPG