Gene Sare_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3975 
Symbol 
ID5705252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4515671 
End bp4517422 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content69% 
IMG OID641273400 
Productvon Willebrand factor type A 
Protein accessionYP_001538756 
Protein GI159039503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.734666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTCCAG GCCGCCATCG CATCCGAACG AACGTCCGCG CTGCCGGAGC CGCGGCAGCG 
GCCGGCGTGC TCGCCGTCGC TGCTGGTGGC TACTTCGGCT ACCGCCAGCT CGCCTCACCG
GGCTGCTCAG GCCAGATCGA GTTGGCTGTC GCGGTCGCGT CCGAGCTGGC GCCGGCGGTC
GACACCACGG CGACCGAGTG GGAGAACGAG GGCGCAGTGG TCGGCGGCAC CTGCATCGAG
GTCAACGTCA CGGCCTCCGA CCCGGTCGAG GTAGCCGCCA CCGTCGCGGC CAAGCATGGT
GCCGTCCTGG CCGGGGTGGG CCAGGCCAGC GGCACCGCGA TCAGCCCGGA CGTCTGGGTG
CCCGACTCGT CCGCGTGGCT GCTGCGGCTC AAGACCGGGG GCGCGACCGC GTTCGACCCG
GGTAACAGGG CGTCCATCGC CTACAGCCCG GTGGTCGTGG GGGTGCCGGA GCCGATCGCC
ACCCAGCTTG GCTGGCCAGA GAGCAAGCTC ACCTGGTCAG GGCTGGTCGG CCAGGTCAAC
AACTCCAAGC CGATCAAGGC CGGCACCGTG AATCCGACCC GGGATGCCGC CGGTCTCTCC
GGGCTTCTCG CGCTGAGCGC TGCCGCCGGG GCCGGGGAGA ACGGCCAGGC AGCCACCGTC
GGCGCGTTGC GTGCGCTGTC GACCAACAGT GCGAATCTGC GTCAGGAACT GCTCGCGAAG
TTCCCCACCT CCCCGGATTC CACATCGGTG GCCCGTGGTC TCGGCGCGGC GGCGTTGTCC
GAGGAGGATG TGCTCTCGTA CAACGCCAGG AAGCCGGCGG TGCCGTTGGT GCCGCTCTAC
CTGGAGCCGG CGGCGATGCC GTTGGACTAT CCGTACGCGG TGCTGCCCGG GATCGAGCCA
GCCAAGGCGT CCGCGGCGCA GATGCTGTTT GAGGTGCTCG CCACAGCCAG TTTCAAGGAC
CGGTTGGCGC CGCTGTCGCT GCGTGCGCCG GATGGCACCT GGGGCGCTGG TTTCGGCGCG
CCCCAAGGGG CGCCGAGTCC GGAGGTCGGT GGGGCATCGA CGGAGCCCGG CAGTGGCGAC
GCCGCGGGTG CCGTGGATCC GGTGGCGGTT GACCGGGCGG TCGCCAGCTG GTCGATTGCC
ACCCAGTCTG GCCGGATGCT CTGTGTCATC GATGTCTCCG GCTCGATGAA GGGTTCGGTG
GCGGGCGCCG GCGGTGCCAG CCGTCAGCAG GTCACCCTGG ATGCCGCGCG GCGAGGGCTC
AGCCTGTTCG ACGACAGCTG GCAGATCGGA CTGTGGGAGT TCTCGACGAA TCTCGGCAGC
GGACGGGACT ACCGGCGACT GGTCGAGATC GGCCCGTTGA GCAACCAGCG AAGCAGGCTT
GAGCAGGCGT TGACCCAGAT CCAGCCCACT CGGGGTGACA CCGGCCTGTT TGACACGGTG
CTCGCCGCCT ACGAGGCGGT TCAGGAGGAA TGGGATCCAG GCCAGGTCAA CTCCATCGTG
CTCTTCACCG ACGGTAAGAA CGACGACGAC AACGGCATCA GCCAGCAGCA ACTGCTCGCT
GAACTGGAGC GGATCAAGGA CGCGGAGCGG CCGGTGCAGG TGGTGCTGAT CGGGATCGGC
GCGGATGTCA GCAAGGCAGA GTTGGAGTCG ATCACCAAGG TCACCGGTGG TGGTTCCTTC
GTCACGGAGG ATCCAACCAA GATCGGGGAC ATCTTCCTCA AGGCCATCGC GCTGCGGAAG
CCGGGTGCCT GA
 
Protein sequence
MSPGRHRIRT NVRAAGAAAA AGVLAVAAGG YFGYRQLASP GCSGQIELAV AVASELAPAV 
DTTATEWENE GAVVGGTCIE VNVTASDPVE VAATVAAKHG AVLAGVGQAS GTAISPDVWV
PDSSAWLLRL KTGGATAFDP GNRASIAYSP VVVGVPEPIA TQLGWPESKL TWSGLVGQVN
NSKPIKAGTV NPTRDAAGLS GLLALSAAAG AGENGQAATV GALRALSTNS ANLRQELLAK
FPTSPDSTSV ARGLGAAALS EEDVLSYNAR KPAVPLVPLY LEPAAMPLDY PYAVLPGIEP
AKASAAQMLF EVLATASFKD RLAPLSLRAP DGTWGAGFGA PQGAPSPEVG GASTEPGSGD
AAGAVDPVAV DRAVASWSIA TQSGRMLCVI DVSGSMKGSV AGAGGASRQQ VTLDAARRGL
SLFDDSWQIG LWEFSTNLGS GRDYRRLVEI GPLSNQRSRL EQALTQIQPT RGDTGLFDTV
LAAYEAVQEE WDPGQVNSIV LFTDGKNDDD NGISQQQLLA ELERIKDAER PVQVVLIGIG
ADVSKAELES ITKVTGGGSF VTEDPTKIGD IFLKAIALRK PGA