Gene Sare_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3830 
Symbol 
ID5704854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4361985 
End bp4363379 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content71% 
IMG OID641273252 
ProductCBS domain-containing protein 
Protein accessionYP_001538614 
Protein GI159039361 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0304655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGCT CTTCCGCCGC AACAACGGTC GGCTCTGCCG GCCTTCCCGA CGTGCAGCTG 
ATCGTGGTCG CGGCCGGGCT GGTGGTCCTC GCCGGCCTGA TCGCGATGAC GGAGGCCGCG
CTCTCCGCCG TCTCTCCGGC ACGCGCCGCC GAACTGGCCC GCGATGGCGC CCGTGGCGCC
CGAGCGTTGC AGTCCGTCGC GAGTGACGTG GTCCGGCACC TCAACCTGCT GCTGCTGTTG
CGGCTGCTCA CCGAGCTGAG CGCGACCACT CTGGTGGCGC TGGTCGCGGT CGACTCGTTC
GGCGCTGGTT GGCGGGCCGC GCTGGTGACG GCCGGGGCGA TGACCGTGGT CAGCTTCGTG
GTGGTCGGCG TCGGGCCGCG CACGATCGGC CGGCAGCATG CCTACGCGGT GGGTCGCGGC
GTGGCGCCGC TGGTGCGTTG GCTGGGTCGG GCGCTCAACC CACTCGCCTC CCTGCTGATC
CTGATCGGCA ACGCGGTCAC CCCGGGGCGG GGCTTCCGGG AGGGGCCCTT CGCCACCCAG
GTGGAGCTGC GCGAACTGGT GGACCTGGCC GAGCAGCGCG GTGTGGTGGA GCATGGCGAG
CGGCAGATGA TCCACTCCGT CTTCGCGCTC GGCGACACCA TCGCCCGCGA GGTGATGGTG
CCGCGTACCG AGATGGTGTG GATCGAGCGG CACAAGATGC TGTCCCAGGC CCTGGCGCTC
TTTCTGCGGT CCGGCTTCTC TCGGATTCCG GTGATCGGCG AGAGCGTCGA CGACGTGCTC
GGCGTGCTCT ATCTGAAGGA TCTGATCCGG CGCACGCAGG GCGGTGCCCC GGAGGACCGA
CGCCTCCCCG TGGCCGAGCT GATGCGTCCG GCCACCTTCG TGCCGGAATC CAAGCCGGTC
GACGACCTGC TCTCGGAGAT GCAGGCCGCC CGGAACCACC TGGTAATCGT CGTTGACGAG
TACGGCGGTA CCGGCGGGTT GGTCACCATC GAGGACATCC TGGAGGAGAT CGTCGGCGAG
ATCACCGACG AGTACGATGT CGAGCGCCCA CCGGTCGAGC GCCTCGACGA CGACGCGGTG
CGGGTCACCG CGCGGCTTCC CGTGGATGAC CTCGGCGAGT TGTTCGACAC CGAGCTGCCC
GGCGACGAGG TGGAGACGGT GGGCGGACTG CTCGCGCAGT CGCTGGGCCG GGTTCCGATC
CCCGGTGCCC AGGTCGAGGT GGCTGGTCTG CGGCTGCTCG CCGAGGGCAC CACCGGCCGG
CGCAACCGGA TCGACACGGT GCTGGTGCGC CGGGTGGAGC CGGCCGACCA GCAGCACGAT
CCGGGTCGCG GTGAACCGAC CGAGACCCGG GACGACACCG ACCAAGCCGA GGAGAGGCAA
CCCGCCGATG CCTGA
 
Protein sequence
MMGSSAATTV GSAGLPDVQL IVVAAGLVVL AGLIAMTEAA LSAVSPARAA ELARDGARGA 
RALQSVASDV VRHLNLLLLL RLLTELSATT LVALVAVDSF GAGWRAALVT AGAMTVVSFV
VVGVGPRTIG RQHAYAVGRG VAPLVRWLGR ALNPLASLLI LIGNAVTPGR GFREGPFATQ
VELRELVDLA EQRGVVEHGE RQMIHSVFAL GDTIAREVMV PRTEMVWIER HKMLSQALAL
FLRSGFSRIP VIGESVDDVL GVLYLKDLIR RTQGGAPEDR RLPVAELMRP ATFVPESKPV
DDLLSEMQAA RNHLVIVVDE YGGTGGLVTI EDILEEIVGE ITDEYDVERP PVERLDDDAV
RVTARLPVDD LGELFDTELP GDEVETVGGL LAQSLGRVPI PGAQVEVAGL RLLAEGTTGR
RNRIDTVLVR RVEPADQQHD PGRGEPTETR DDTDQAEERQ PADA