Gene Sare_4422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4422 
Symbol 
ID5703935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4996666 
End bp4997766 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID641273841 
Producthypothetical protein 
Protein accessionYP_001539190 
Protein GI159039937 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0164716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC GCCCCGCCGT AGTGCCCCCG CACCCCCTGC CCCTGCTCGT GACCTCCGAC 
GATGCTCTGC TGGACGACCT ACTGCGGCTC GCCGCGGCCG GAGGCGTCGA GGTCGACCTC
GCCCCCGATC CGGTGTCGGC CCGTTCCCGC TGGTCACCCG CCCCACTGGT GCTGGTCGGC
AGCGACCAGG CACAGCCGTG TCTGCGGGCG CGGCTGCCGC ATCGGCGGCG GTTGGTGCTG
GTCGGCCGCT CCGGGCAGCT CGACCCCGGC AGGGATGTCG CCGACCTGAT GGGTGCCGAG
TACGTCGCCG TCCTGCCCGC CGCCGAACCC TGGCTGGTGG ACCGGTTCGT CGAGTGCGGC
CCGGATCGAG CCAACCCGGT AGCGGCCCGG GTCGTCGCCG TCCTCGGCGG ACGGGGTGGT
GCCGGTGCGA GTGTGGTCGC TGGCGGGCTC GCCGTCACGG CGGCCCGGTC CCGGCTGCGG
ACACTGCTGG TTGATGCCGA CCCGCTCGGC GGTGGGCTGG ACCTGGTGCT CGGCTGGGAA
CAACAGGCCG GACTGCGCTG GCCTGCGCTG ACCGACGCCG ACGGACGGGT CGACGCGTCG
TCGCTGGTGC GGGCCCTGCC GAGCCGGGGC GACCTGGTGG TCCTGTCCTG GGATCGTGGT
GATCTCCGCT CGTTGCCCTC CCCGGCGATG GCCGCGACCC TCGACGCCGC CCGTCGCGCC
TGTGACCTGG TCGTGGTCGA CCTGCCCCGA CACCTGGACG ACGCGGCGGT GACCGCCCTG
CAGTCGGTCG ACCGGGCTTT CCTCGTGGTA CCCGCCGAAC TCAGGGCGGC GGCGGCTGCC
GCTCGGGTAG TCCGCGCCGC CGCGCCACAC TGCGCCGACC TGTCCCTGAT CATTCGTGGG
CCATCCCCGG GCCGGATCAG GGCCGCCGAG CTTGCGCGAA CGCTTGGGCT GCCGCTGGCC
GGTACGGTGC GTCCGGAGCC GGCGCTCGGG CGCGGCCTGG AACGTGGTGA AGCGCCGGCC
GCGGACGGGC GCGGCCCACT GGCCGCCCTG TGCCAGCGAC TCGTTGGCGA ACTCACCGGC
ACCGCACCGG GCGCGGCATG A
 
Protein sequence
MPPRPAVVPP HPLPLLVTSD DALLDDLLRL AAAGGVEVDL APDPVSARSR WSPAPLVLVG 
SDQAQPCLRA RLPHRRRLVL VGRSGQLDPG RDVADLMGAE YVAVLPAAEP WLVDRFVECG
PDRANPVAAR VVAVLGGRGG AGASVVAGGL AVTAARSRLR TLLVDADPLG GGLDLVLGWE
QQAGLRWPAL TDADGRVDAS SLVRALPSRG DLVVLSWDRG DLRSLPSPAM AATLDAARRA
CDLVVVDLPR HLDDAAVTAL QSVDRAFLVV PAELRAAAAA ARVVRAAAPH CADLSLIIRG
PSPGRIRAAE LARTLGLPLA GTVRPEPALG RGLERGEAPA ADGRGPLAAL CQRLVGELTG
TAPGAA