Gene Sare_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1610 
Symbol 
ID5703465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1840742 
End bp1842043 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID641271119 
Productpolymorphic outer membrane protein 
Protein accessionYP_001536494 
Protein GI159037241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00898059 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCATC AGCACCACAC CCACGAACTC GAACCGGACC GGGGCCGGGT GAGGTCACGG 
CGGCGGTGGT GGGCTGTCGG GCTCGCCGGG ATGACCGGCC TGGCCCTCAC CACCACTGTC
GGCGTCGCTG CCATCCCAGC CGCCGGCGCG GTCGCACGTA CCCTCACCAC CACCGGCGAC
CACCCCGAGA AGCCGTCCCT TGGCGACCAC CGGGGCAAAA GCGACGGCGA CAAGGGCGAG
GCGAAGCGGG AGACGAAGCC GAAGGGCATC CCGGTCCCCT GTGACCCGGA CCGGCTGATC
GCCGAGATCA ACCTCGCCAA CGCTCGCGGC GGAGCCACCC TCGACCTCGC CAAGAAATGC
GCATACACCC TGACCGCCGA CATCGACGGC GCCGGACTGC CTGCCATCAC CACCCCCATC
ACGCTCAACG GTGGCAAGAG CACCACTATC ACCCGCGCCG CGGCAGCCGA GCCATTCCGC
ATTCTCAACG TCAACGTCAA CGGCCGGCTC ACCCTCAACC ACCTCACCAT CACCGGTGGC
CAACCAGCGC CCGGTGGTCA GGGGGGTGGG ATTCTTGTCA ACCCGGGCGG CGGCGCCACA
ATCAACCACA GTAGAGTCGT CCGTAATATA TCGGGTAACA ATGGTGGTGG AGTTGCGAAT
CTCGGTGGCA GCCTCGAAAT CAGGAACTCC ACTGTCGGTC ATAACACCGC CGCTATTACG
GGAGGTGGGG TCGTCAGCGC TGGGAAACTC GTGGTCGACA AGTCCCGTTT CGATGCCAAC
ACCGCCTCGG CTGCCGGAGG CATTGGCGTC AGTGGTGTAC TGAGTATCAC CCGGAGCGAG
ATTGTCAACC ATCGGGCCGG GACCGGTGGA GGTATGTTCA TCTTCGCCGC GACCGGCACG
ATCAGCGACA CACGCTTCGA AAGGAACACC ACGACGGACG TTGGTGGAGC CGCCATAGGC
GGCGGGCCGC TGCAGCTCAC CCTGTCGCGC GTCACCATCG CCAACAACAC CGCCACCAGT
GGGCTGGGCT TCGGTGCCCT GTTTCTGCAG GGCGGTACCG CCCTGGTCGA GGACAGTGTC
ATCAGGGACA ACATCGGAAC CAACGGCGGC GGCATCCGCA ACCTGGGCAC GTTGACGCTG
ATCCGCACCA AGGTCACCGG CAACCAAGCC ACCGGGTCGG GCGGCGGAGT GTTCAACGAA
TTCATCGGCC GCCTCGCTCT TTTCGCAACC CACATCACCA AGAACACTGC CGGTAGCGAC
GGCGGGGCAT TGTCAACCAG GCGGGTGGCA TCGTCGACCT GA
 
Protein sequence
MNHQHHTHEL EPDRGRVRSR RRWWAVGLAG MTGLALTTTV GVAAIPAAGA VARTLTTTGD 
HPEKPSLGDH RGKSDGDKGE AKRETKPKGI PVPCDPDRLI AEINLANARG GATLDLAKKC
AYTLTADIDG AGLPAITTPI TLNGGKSTTI TRAAAAEPFR ILNVNVNGRL TLNHLTITGG
QPAPGGQGGG ILVNPGGGAT INHSRVVRNI SGNNGGGVAN LGGSLEIRNS TVGHNTAAIT
GGGVVSAGKL VVDKSRFDAN TASAAGGIGV SGVLSITRSE IVNHRAGTGG GMFIFAATGT
ISDTRFERNT TTDVGGAAIG GGPLQLTLSR VTIANNTATS GLGFGALFLQ GGTALVEDSV
IRDNIGTNGG GIRNLGTLTL IRTKVTGNQA TGSGGGVFNE FIGRLALFAT HITKNTAGSD
GGALSTRRVA SST