Gene Sare_4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4374 
Symbol 
ID5705065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4940912 
End bp4942291 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content64% 
IMG OID641273796 
Productpolymorphic outer membrane protein 
Protein accessionYP_001539146 
Protein GI159039893 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000585185 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGCATC AGCACCACAC TCACGAATCC AGAACGGATG GTCCCTGCGG GGGGCGGGCG 
AGGTCACGAT GGTGGGCGGT CGGATTGGCC GGGATGACCG GCTTGGCCCT CACCGCCACT
GTCGCCGTCG CCACCCCTGT CGCTGACGCT GTCGGACGGG CCACCGTCGC CGACGACCGG
CTTGGGAAGC CCGACCGGCC ACAGGGGGGT GATCACCGTG ACGAAGGTAA GGGGGAGGAC
GCAAGCCACG TCAAGAAGCC AACGGGCACA CCTGTCCCCT GCAGCACGGG CCGGCTGATC
GCCGCGATCA CCCTGGCCAA CGCCCGCGGC GGTGCCGTGC TCGACCTCGC CAAGAAGTGC
ACCTACCTGC TCACCGCCGA CATCGACGGC GCCGGTCTAC CCGCGATCAC CACCCCCATC
ACTCTCAACG GCGGCAAACA CACCACCATC AAACGCGCCG CGGCGGCCGA CGACTTCAGA
ATCCTGACCG TGAACGCCAA CGGTCGACTC ACCCTCAATC ATCTCACCAT CACCGGCGGC
AGGCTTTCCG GCGACAATGA CGGCGGCGGG ATTCTTATCA ATTCGGGTGG CGGTGCCACT
GTCGACACCA GTAAAATCGT CGCAAACGTC TCGGTTGACG GCGATGCTGG TGCGATCATG
AATAATGGCG GTGTGCTCGA CATCAGGCAT TCCATCATCA GTCGCAATAC GGCCGCCAAT
ATCGGCGGCG CGATCTTCAG TATTGGTCAA CTCGTCGTCG ACAAGTCACG GTTCGATGCC
AACGCTGCCC TGACTGGCGG TGCCATCACC ATCAGTGGCG ACGTCACCAT AACCCGGAGC
GAGTTGGTCG ACCATCAGGC TGCCGACGGT GGCGCCGTCT TCTTCCTCGG CGGGTCGACC
GGCAAGATCA CCGATACGCG TTTCGCGCGA AACACGGCGA CGAACACCGG CGGATCCGCC
ATCGGCGGGG GCCCTACACA GCTCACCATG TCCCGGGTCA CCCTCGCCAA CAACACCACA
ACCGGTGCCG GCGGGGGCGC ACTGTTTCTA CAAGGCGGAA GCGCGCTCGT GGAGGACAGT
GTCATCAAGA ACAATGTCGG AACAAACGGC GGTGGCATTC GTAATCTTGG CGGGTTGACG
CTGCTCCGCA CACAGGTCAC CGGCAACCAG GCCACCGAGT CGGGCGGCGG AATCCTCAAC
GAGGCAAACG GCGTGCTCGC GCTGCTCAGC ACGAAGGTGG TCAAGAACGT CGCCGGCACC
GACGGCGGCG GCATCTTCAA CGCGGTGGGT GGCACGGTCG ACCTGAACAC CGCCACCGGC
ACCATCGTGG CCAAGAACCG ACCGAACAAC TGCACGAACG TTCCGGACTG CCCGGACTGA
 
Protein sequence
MKHQHHTHES RTDGPCGGRA RSRWWAVGLA GMTGLALTAT VAVATPVADA VGRATVADDR 
LGKPDRPQGG DHRDEGKGED ASHVKKPTGT PVPCSTGRLI AAITLANARG GAVLDLAKKC
TYLLTADIDG AGLPAITTPI TLNGGKHTTI KRAAAADDFR ILTVNANGRL TLNHLTITGG
RLSGDNDGGG ILINSGGGAT VDTSKIVANV SVDGDAGAIM NNGGVLDIRH SIISRNTAAN
IGGAIFSIGQ LVVDKSRFDA NAALTGGAIT ISGDVTITRS ELVDHQAADG GAVFFLGGST
GKITDTRFAR NTATNTGGSA IGGGPTQLTM SRVTLANNTT TGAGGGALFL QGGSALVEDS
VIKNNVGTNG GGIRNLGGLT LLRTQVTGNQ ATESGGGILN EANGVLALLS TKVVKNVAGT
DGGGIFNAVG GTVDLNTATG TIVAKNRPNN CTNVPDCPD