Gene Sare_4887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4887 
Symbol 
ID5707539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5541710 
End bp5543455 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content69% 
IMG OID641274282 
Productcoagulation factor 5/8 type domain-containing protein 
Protein accessionYP_001539627 
Protein GI159040374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.825597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAG CTCACCCGAA ACCCCGTCGT CCGCGTCACC GCCTCGCACC AGCGCTTGCG 
ATCCCCCTCA TGTTGATCGC GAGCGCCCTG ACCATCTCCG CCCAGCGTGC CGACGCCGCC
CCCGTCCTGC TGTCGCAGGG CAAGCCGACC ACCGCCTCCT CCGTCGAACT GGACGAGACC
CCACCCGCCG CCGCCACCGA CGGCGACCCG GGCACCCGCT GGTCCAGCGA GTTCGCCGAT
CCGCAGTGGC TCCAGGTCGA CCTGGGCGAC ACCGCCACGA TCAGCCAGGT GACCCTGCGC
TGGGAGGCCG CCTACGCGAG CGCCTTCCAG ATCCAGGTGT CCGTGACCCC CTCCGGACCC
TGGCAGACGA TCTACCAGAC CGCCGCCGGA ACGGGCGGCA CCCAGCATCT CGCGGTCAGC
GGCACCGGGC GCCACGTCCG GGTCCACGGC ACCGCCCGCG GCACCGGCTG GGGCTACTCC
CTGTGGGAGT TCGAGGTCTA CGGCACCATC TCGGCAAGCT GCACGGGCAA TGCCGCGCGT
GGTCGGGACG CTACGGCCTC CTCAGTCGAA ACCGCCGACA CCCCCGCCGC CGCGGCCGTC
GACGGCAACG CCACCACGCG GTGGTCGAGC ACCTTCGCCG ATCCGCAGTG GATTCAGATC
GACCTCGGTG ACGTCGCGAC GATCTGTCAG GTCGTACTTC GGTGGGAGAC GGCCGCCGCG
CGGGCGTTCG ACATCCAGAC AGCCACCAAC CCCAACGGGC CGTGGACCAC CCGCTACGCC
ACCACCACCG GCGCCGGTGG AGTGCAGACG CTCGACGTGT CCGGGACCGG ACGTTACGTG
CGCATGCACG GCACCGCCCG CACCACACCG TACGGCTACT CGCTGTGGGA GTTCGTCGTG
CGTACGTCGG GGTCCGCCCC GCCGCCCGAC GACTTCTGGG GCGACACCAG CTCGATACCG
CCCGCCCAGA ACGTCCTCAC GGTCAAGATC CTGAACCGAA CCAACGGACG CTACCCCGAC
AGCCAGGTGT ACTGGAGCTT CGGCGGCCAG ACCCGGTCCA TCGCGGAACA GCCCTACATC
GACATGCCCG CGAACTCGGC CGGCCGGATG TACTTCCACC TCGGCTCACC GACCAGCCAG
TACCGTGACT TCGTCGAGTT CACCGTTGCC CCGGACCGAT TCAACGGCAA CACCACCCGG
GTAGACGCGT TCGGGCTCAA GCTGGCGATG CGGCTGCGCG CCCACGATGG ATTCGACAAG
GCAGTCGGGG AAACCGAGGC CACCTTCGCC GAGGACCGCG CCACGACCTT CCAGCGGTTC
TCGGACGCGA TGCCGGCCGA GTTCACACAC CTGGCCACGA TCGAGGCGCC GTATCGGGTG
CCGTCCCCGG GCAACACGCC CCAGTTCCGC GCCGGCGGTC AGTACGCCGA CTACCTGAGC
GGGTACGCGT CGTCGGTCGG GATTCCCGCC TCCACCGCCG AGATCTTCGG GTGCTCCGGA
CCGTTGGCCA GCAACCCCGA CGGCTGTGCC GCACTGAACC GTCATGTCGC CCACCTGCCC
CGGTCGCAGT GGGAGAACCC GGCCCTGTTC TACCAACAGG CACCGGCCAA CTACTACGCG
AAGTTCTGGC ATGAGCAGTC CATCGACGGC CTGTCCTACG GTTTCCCCTA CGACGATGTC
GCCGACCAGT CATCGTTCGT CTCAGCCGGT GACCCGCAGT GGTTGATCGT CGCGGTGGGG
TGGTAG
 
Protein sequence
MPTAHPKPRR PRHRLAPALA IPLMLIASAL TISAQRADAA PVLLSQGKPT TASSVELDET 
PPAAATDGDP GTRWSSEFAD PQWLQVDLGD TATISQVTLR WEAAYASAFQ IQVSVTPSGP
WQTIYQTAAG TGGTQHLAVS GTGRHVRVHG TARGTGWGYS LWEFEVYGTI SASCTGNAAR
GRDATASSVE TADTPAAAAV DGNATTRWSS TFADPQWIQI DLGDVATICQ VVLRWETAAA
RAFDIQTATN PNGPWTTRYA TTTGAGGVQT LDVSGTGRYV RMHGTARTTP YGYSLWEFVV
RTSGSAPPPD DFWGDTSSIP PAQNVLTVKI LNRTNGRYPD SQVYWSFGGQ TRSIAEQPYI
DMPANSAGRM YFHLGSPTSQ YRDFVEFTVA PDRFNGNTTR VDAFGLKLAM RLRAHDGFDK
AVGETEATFA EDRATTFQRF SDAMPAEFTH LATIEAPYRV PSPGNTPQFR AGGQYADYLS
GYASSVGIPA STAEIFGCSG PLASNPDGCA ALNRHVAHLP RSQWENPALF YQQAPANYYA
KFWHEQSIDG LSYGFPYDDV ADQSSFVSAG DPQWLIVAVG W