Gene Sare_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2104 
Symbol 
ID5704718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2425310 
End bp2426431 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID641271589 
Productabortive infection protein 
Protein accessionYP_001536960 
Protein GI159037707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.574556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0108203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACC AACCGACCGG CCCCGCCGCG GGCTTGGCGG CACCGGGCCT CGTGCACCGG 
GGGGTCGCGT ACGACACCGG CACGAACTTC GCGACCGGTC AGGGGGCGTT GTCCCGCACC
TGCTGGACCA CCTCGAACAT GCTGTCCGAG ATCAGCCTGA TCAGTGACCA GCTGAACTGC
AACTCGGTGA CCATCTACGG CAGCGACCTC GACCGACTCA CGGCGACCGC CGAGGCCGCC
GTGGCACGGG GTCTGCACGT CCGGTTGCAG CCCCGGCTCG TGGACCGGCC GCAACCGGAC
GTCCTGGAGC ACCTCGCCGA GGCCGCCCGA CTCGCCGAGT CACTACGCCG CCAGGACGCC
CAGGTGAGCC TCACCGTCGG CGCCGTACAC CTGATCTTCA CACCGGGCAT CACCCCCGGA
GACCAGTATC ACGAGCGCAT GGCCAACGTG TACGCGGACG CCAAGCACCA CCTGCTGACC
CCGACGGGGA CGGTGAACAT GGCGACCGCC ACTCCCCGGC TCAACGAGTT CCTCCACCGG
GCGAGCGGCG TCGCCCGTGG GCTGTTCAAC GGCGAACTGG GCTACTCCGC CGCGCTGTTC
GAAGACGTCG ACTGGCAGCT GTTCGACTCG ATCGGACTCA TGTACCAGTA CCTGCCGAGG
TGGCTGCCCA CGGCGGAGGA GCACATCGCG GAGGTGACGC GCTACCACCG GTGGGGCAAG
CCGATCCACA TCGCCGAGTA CGGCACCGCG ACCTACCAGG GCGCCGAGCA GAAGGCGTTC
TTTTTCTGGG ACATCGTCGA CCGCAGTGGG CCGGTCCCCC TCATCCTCGA CGGCTACGTC
CGGGACGAGA GCGAGCAGGC CGCGTACCAC CTGCGCATGC TCGACGCATT CGAGCGGGCG
GGCGTGCACG GTGTCGCGGT CTCGGAGCTG ATCCATCCCA CCCATCCGCA CTCGACCGAC
CCTCGTAAAG ACCTTGACAT GGCAAGCATG GCCATCGTCA AGACCATTCG GGACGACTTC
GCCGATCCGG CCTCCACCTA CCGCTGGGAG CCGAAGGAGT CGTTTCACAC CATCGCCGAC
CACTACGCCC ACATCGGCTT CCAGGCAGCC GCCCGCAGGT GA
 
Protein sequence
MSHQPTGPAA GLAAPGLVHR GVAYDTGTNF ATGQGALSRT CWTTSNMLSE ISLISDQLNC 
NSVTIYGSDL DRLTATAEAA VARGLHVRLQ PRLVDRPQPD VLEHLAEAAR LAESLRRQDA
QVSLTVGAVH LIFTPGITPG DQYHERMANV YADAKHHLLT PTGTVNMATA TPRLNEFLHR
ASGVARGLFN GELGYSAALF EDVDWQLFDS IGLMYQYLPR WLPTAEEHIA EVTRYHRWGK
PIHIAEYGTA TYQGAEQKAF FFWDIVDRSG PVPLILDGYV RDESEQAAYH LRMLDAFERA
GVHGVAVSEL IHPTHPHSTD PRKDLDMASM AIVKTIRDDF ADPASTYRWE PKESFHTIAD
HYAHIGFQAA ARR