Gene Sare_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2216 
Symbol 
ID5703897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2551758 
End bp2553041 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content72% 
IMG OID641271696 
Productvon Willebrand factor type A 
Protein accessionYP_001537067 
Protein GI159037814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.359753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.122606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGA CGAGACGATC GGCGGCAGTC CTCGTCGGAC TGCTGGCGAT GAGCGTGATG 
ACGGGCCCGG CACCGGCCCT CGCGGACGGC GAGGCCCCGG TCGAACCGCC GAAGGTCGAG
TTGGTCCTCG ACGTCAGCGG TTCGATGCGG GCCACGGACA TCGACGGGCG AAGCCGGATC
TCGGTGGCCC AGCAGGCGTT CAACGAGGTG GTAGACGCGC TGCCCGACGA GACTCAGCTC
GGCATCCGGG TCCTCGGTGC CACCTACCCG GGTGAGAACA AGGAGCGGGG CTGCCAGGAC
ACCCAGCAGA TCGTACCGGT GGGGCCGGTC GACCGGGTGC AGGCCAAAGC CGCGGTCGCG
ACCCTCCGTC CGACGGGATT CACCCCGGTC GGGCTGGCCC TGCGCTCGGC TGCCCAGGAT
CTCGGCACCG GTAGCACCGC CCGGCGGATC GTGCTGATAA CCGACGGTGA GGACACCTGC
GCCCCTCCCG ACCCCTGCGA GGTGGCCCGG GAACTGGCCG CGCAGGGCAC GAAACTGGTC
GTGGACACCC TTGGCCTGGC GCCGGACGAG AAGGTACGCC GCCAGCTGCT CTGCATCGCC
GCAGCCACCG GTGGCACGTA CACCGCGGCG CAGAGCGCGG ATGAACTGAC CGGCCGGATC
AAGCAACTGG TCGACCGGGC ACGGGACACG CACACCGCCA CGCCGGCAGT GGTCGCCGGC
ACCCCGGCCT GCGCCGACGC GCCGCTGCTT GGCGCCGGGG TCTACAGCGA CCGTGAGAAG
TTCTCGGAGC ACCGTTGGTA CCGGGTACCG GTGCACCCGG GGCAGGAACT GCGCGCCTCG
GTCAGCGTGG CGTTGGACCG GCCGGTCAAC CCCGACCATG CGGTGCTGCT GCGGGCCGTG
GCCACCGACG GTCGGGAGTT GGTGCGTGGC GTGGATGCCG GCAGCGGCCG GACCGACGTG
GTCTCCGCCG GCCTGCGGTG GTCGGCGAGT GAGGAGCCGG AGGACGGGCC GTCCCCGACC
CCGTCGGCCA CGACCGGTGC CGAAGCCACC ATCGTCTGCC TCGTGGTGAG CAATGCCTTC
GCGCCTCAGC CGGGAACCCA GACGTCACCC GGGCTGCCGG TCGAGCTGAC CGTGGACGTG
GTCGCGTCCT CGCCTGCCCC GGCGGCTCCG GATCTGGGTC GTGGCTGGGT GCTGCTCGTC
CTGCTGACCG TGGTCGGCCT GCTGGCCGGA CTGGCGTCCG GGATGCTCAG CCGGTGGTGG
CTGGCGACCT GGAGGGAGAA GTGA
 
Protein sequence
MIRTRRSAAV LVGLLAMSVM TGPAPALADG EAPVEPPKVE LVLDVSGSMR ATDIDGRSRI 
SVAQQAFNEV VDALPDETQL GIRVLGATYP GENKERGCQD TQQIVPVGPV DRVQAKAAVA
TLRPTGFTPV GLALRSAAQD LGTGSTARRI VLITDGEDTC APPDPCEVAR ELAAQGTKLV
VDTLGLAPDE KVRRQLLCIA AATGGTYTAA QSADELTGRI KQLVDRARDT HTATPAVVAG
TPACADAPLL GAGVYSDREK FSEHRWYRVP VHPGQELRAS VSVALDRPVN PDHAVLLRAV
ATDGRELVRG VDAGSGRTDV VSAGLRWSAS EEPEDGPSPT PSATTGAEAT IVCLVVSNAF
APQPGTQTSP GLPVELTVDV VASSPAPAAP DLGRGWVLLV LLTVVGLLAG LASGMLSRWW
LATWREK